FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGA AATG TGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CATACAT TACTGCAGT AACAC T CCACCATATAGACCCGGCT T TACCT TAT AT CAGTGACACT GG 
TACAGTAGCTCCAGAAAAATGCTTAT T TGGGGCAATGCTAAATAT TGCGGCAGTTTTATGCATTG 
^ CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
#*\ TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
fl CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGT T TG T T CAGACCATCC TT TCCTACCAAAT GCAGCCCAAAAT CCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTT TT CCTGACT TACAT T CGT GAT T T TCAGAAAAT T T CT T TACGGGTGGAAGCCAAT TT ACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATAT T TGATGAAAGGATAAAATAT TTC T GTAATGATT AT GAT TCTCAGGGAT TGGGGAAAGG 
TTCACAGAAGT T GCT TATT C T TC TC TGAAAT T T TCAACCACT TAATCAAGGCTGACAGT AACAC T 
GAT GAATGCTGATAATCAGGAAACATGAAAGAAGCCAT T TGATAGATTAT TC TAAAGGATAT CAT 
f|| CAAGAAGACTATTAAAAACACCTATGCCTATACTT TT TTATCTCAGAAAATAAAGTCAAAAGACT 

ATG 



fit 
.... ., 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX{S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEEOTIIKLNKAGLV^ 

MGSLYMFVQT I LS YQMQPKIHGKQVFW IRLLLVI WCGVSALSMLTCS SVLHS GNFGTDLEQKLHW 
N P E DKG YVLHM I T T AAE WSMS F S F FG FFL T Y I RD FQK I S LRVEANLHG L TL YDTAPC P I NNE RT R 
LLSRDI 



Important features: 

Type II transmembrane domain: 




amino acids 13-33 



Other Transmembrane domains: 



amino acids 54-73, 94-113, 160-180, 122-141 



N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
PI CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
ri CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
f|! TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
pj CTCAGGAATGTGACT GACACGGATATTCTAGCCCTGGAACGGCGACT GCTGCAAACCATGGATATGAT 

^ 1 CATAAGCAAAAAGAAAAGGAT GGCAATGGCACGGAGAACAATGTT CCAGAAGGGGGAAGT GCATAACA 

AACCATCAGGT TTCT GGGGAATGATAAAAAGT GTTACCACTTCAGCATCAGGAAGTGAAAATCT TACT 

if! 

CTTATTCAACAGGAAGTGGAT GCTTTGGAAGAATTAAGCAGGCAGCTTTTT CT GGAAACAGCT GATCT 

3 

m ATATGCTACCAAGGAGAGAATAGAATACT CCAAAACCTTCAAGGGGAAATAT T TTAATTTT CTTGGTT 

||1 ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
O GGGAAAACGGAT CCT GTCACAAGAGGCATTGAGAT CACT GTGAATTAT CTGGGAATCCAATTTGATGT 

fll GAAGT TTTGGT CCCAACACAT TTCCTTCATT CTTGTTGGAATAATCAT CGT CACATCCAT CAGAGGAT 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
*** CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACATT TTATAAACAAACAAAAT GCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACAT GAGAGGGAGAACTA&CTCAAGACAATACTCAGCAGAGAGCAT CCCGT GTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE A 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVI FSVTFAFSCTMFELI I FEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQL1SRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFS IYCVWKI FMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS SVLL IRMSMPLE YRT 1 1 TE VLGELQFNFYHRWFDVI FLVSALS S IL FLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE g 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCT GAAGGGG T AGGC AC GATGGC CAGG T GC T T CAGC C T GGT GT T GC TT C T CAC T 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
. GT CATGCAGAAT TATGGGGAT CAC C C T TG T GAGCAAAAAGGCGAACCAGCAGCTGAATT T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T T GCAGC CT AT TGT TACAACT CATCT GATACT T GGAC TAACTC GTGCATT CCAGAA 
i , AT TATCACCACCAAAGAT CCCATAT TCAACAC TCAAAC T GCAACACAAACAACAGAATT TAT TGT 

J* CAGTGACAGTACCTACTCGGTGGCATCCCC T TAG T CTACAATACCTGCCCCTAC TAC TACTCC TC 

y CT GCTCCAGC T TCCACTT CTATTC CACGGAGAAAAAAAT TGAT T TGTGTCACAGAAGTT TT TAT G 

Q GAAACTAGCACCATGT CTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAAT GA 

||1 AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
f a 1 CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
T GAG G AAT C AAAGAAAAC T G AT AAAAAC C CAG AAG AG T C C AAG AG T C C AAG C AAAAC TAC CGTGC 
P GATGCCT GGAAGCTGAAGTT TAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTTTCT TT 

%y CATGC TCCT TACCC TGCCCCAGC T GGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 

CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
^1 GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
Wl CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGQTAGGTGGGTTG 
Q AAAGCCAAGGAGTCAC TGAGACCAAGGCT T T C TC TACTGAT T CCGCAGCT CAGACCCTT TCTT CA 

f|i GC TCTGAAAGAGAAACACGTATCCCAC CTGACATGTCC T TCT GAGCCCGGTAAGAGCAAAAGAAT 

GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
W GC TAAAATAAAGAAATAGAACAAGGC T GAGGATACGACAGTACACTGT CAGCAGGGACTGTAAAC 

ill ACAGACAGGGTCAAAGTGTT TTCTCTGAACACATTGAGT TGGAATCACTGTT TAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAAC TCT TATAAATT T C TAT TTT TATCTGAGT TACAGAAATGATTAC TAAGGAAGAT T 
AC TCAGTAAT TTGT TT AAAAAGTAATAAAAT TCAACAAACAT T TGC TGAATAGC TAC TATATGTC 
AAGTGCTGTGCAAGGT AT TACACTCTGTAAT TGAATATTAT TCCTCAAAAAATTGCACATAGTAG 
AACGC TATC TGGGAAGCT AT T T T T T TCAGT T T TGATAT T TCTAGCT TAT CTAC T TCCAAAC TAAT 
T TT TATT TT T GCTGAGACTAATCT TAT TCAT T T TC TC TAATATGGCAACCAT TATAACCT T AAT T 
TAT TAT TAACATACCTAAGAAGTACAT TGT TACCTCTATATACCAAAGCACATT TTAAAAGTGCC 
AT TAACAAATGTATCACTAGC CCTCCTTTT T CCAACAAGAAGGGAC TGAGAGATGCAGAAATAT T 
TGTGACAAAAAAT TAAAGCAT T TAGAAAACT T 



APP ID=10063569 



Page 157 of 320 



FIGURE 6 



MZ\RCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIENTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTE VFME T S TMS TE TE P FVENKAAFKNEAAG FGGVP TALLVLALL FFGAAAGLG FC YVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 



Signal sequence: 

amino acids 1-16 



§,g Transmembrane domain: 

111 amino acids 235-254 



N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site* 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCGCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GC TC TCAGCTACCCGCAGGAGGAGGCCACCC TCAATGAGAT GT TCCGCGAGGT TGAGGAACT GAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCAT CATCAGAAGT GAACC TGGCAAACTTACCT CCCAGC TATCACAATGAGACCAACACAGAC 
ACGAAGGTTGGAAATAATACCATCCAT GTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
l ml TGGACAAATGGTC TT TTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 

%l ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
J;J ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
|j:f GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
y\ ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
%%l ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
Iff CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
|f| CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
*R GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
^ CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
T AGAAAT AGC TAAT T TAT T T CCC CAGG T GT G T GC T TT AGGC G T GGGC T GAC CAGGC T T C T T C C TA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CI CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
HI GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
|J| T T TGT T CTACATGGCT T T GATAAT TGT T T GAGGGGAGGAGAT GGAAACAATGTGGAGTCTCCCTC 

f|| TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGT TT CAT CTGGT TGT GACT CTAAGCTCAGT GCT C T 
C TCCAC TACCCCACACCAGCC TT GGTGCCACCAAAAGT GCT CCCCAAAAGGAAGGAGAAT GGGAT 
T T T TCT TGAGGCATGCACAT C TGGAAT T AAGGTCAAAC TAAT T CTCACATCCC TC TAAAAGTAAA 
CTACTGT TAGGAACAGCAGT G TT CT CACAGTGTGGGGCAGC CGTCC T TC TAAT GAAGACAAT GAT 
AT TGACACT GTCCC TC T TT GGCAGT TGCAT TAGTAAC T TT GAAAGGTAT AT GACTGAGCGTAGCA 
TACAGGTTAACC TGCAGAAACAGTACT TAGGTAATTGTAGGGCGAGGAT TATAAATGAAATTTGC 
AAAAT CAC T TAGCAGCAAC T GAAGACAAT TAT CAAC CACGT G GAGAAAAT C AAAC C GAGCAG GGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TT T TCAGGTGTCAT GGACTGT TGCCACCAT GTAT TCAT CCAGAGT TCT TAAAGT T TAAAGT T GCA 
CATGAT T GTATAAGCAT GCT T TC T T TGAGT T T TAAAT T ATGTATAAACAT AAGT T GCAT T TAGAA 
ATCAAGCATAAAT CAC T T CAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWIANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVI TSVGDEEGRRSHECI IDEDCGPSMYCQFAS FQYTCQPCRGQKMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHS LVYVCKP T FVGSRDQDGE I LLPREVPDE YEVGS FMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEE I 



Q Signal sequence: 

amino acids 1-19 

jffl 

w 

N-glycosylation site. 

m amino acids 96-100, 106-110, 121-125, 204-208 

s Casein kinase II phosphorylation site. 

IT! 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

Ul. 

is^a 327-331 
Hi 

CJ N-myristoylation site. 

W amino acids 202-208, 217-223 

Ami da ti on site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAG TAT AT T GACCACAAGCT CAGAGGAGTGCAGAACCC T GTAGCCCGCT GCAT CAT GTGCTGT T T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTC T GGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACC T CCGT C TC TAT TAAAAATACAAAAAT TAGCCGAGAGTGGT GGCATGCACC TGTCATCCCA 
GC TACT CGGGAGGC TGAGGCAGGAGAAT GGCT TGAACCCGGGAGGCAGAGGT TGCAGTGAGCCGA 
GAT CGCGCCACTGCACT CCAACCT GGGT GACAGACTCTGTC TCCAAAACAAAACAAACAAACAAA 
AAG AT T T TAT T AAAGAT AT T T T GT T AAC T C 
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FIGURE IP 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQR5VFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I S AFIRT LRYHTGS LAFGAL I LTLVQ I ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRI PGLGKDFKS PHLNYYWLP IMTS ILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 



111 N-glycosylation sites. 

*il amino acids 16-20, 27-31, 289-293 

w 

f'| Hypothetical YBR002c family proteins, 

||! amino acids 276-288 

CI 

pi! if 

Ammonium transporters proteins . 

mi amino acids 204-231 

III 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAAGCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCT GATGCAGACCGAGGAGT GCC CACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACT T CGT GCCT TACT GAGTCTCTAAGACTT T TTCTAATAAACAAGCCAGT GCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 

YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAK 

ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFI SLNLTFCVCVS IAAVLPKV 

QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 

VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 

SFFHFCLVLASLHVMMTLTNWYKPGETRKMIST^^ 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGT CC TATGAAGGAAGGGAAAAGAAAGGCATAT C T GATGTCAGGAGGAC 
TT TCTGT TT GTT T GTCACCT T TGACC T C T TATTCGTAACAT TAC T GTGGATAATAGAGT TAAATG 
TGAATGGAGGCAT TGAGAACACAT TAGAGAAGGAGGT GATGCAGTATGAC TACTAT TCT TCATAT 
T T TGATATAT TT CT TC T GGCAGT T TT T CGATT TAAAGT GT TAATAC TT GCATATGC T GT GTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACT CCT GATAGT T CAGGATGCT TCAGAGAGGGCAGCAC T TATACCT GGTGGTCT TT C TGAT G 
GTCAGT T TTAT TCCCCT CCTGAATCCGAAGCAGGATCTGAAGAAGC TGAAGAAAAACAGGACAGT 
GAGAAACCAC TT T TAGAACTATGAGTAC TACT TT TGT TAAAT GTGAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CAC TGC T GGCT T TATT GAACAGCTAATAAAGATT TATT TATT GTAATACC TCACAAACGT T GT AC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGACTTAATATAT T GAAGTAAC AC T T T T T TAG T AAG C AAGAT AC C T T T T TAT T TCAATTCAC 
AGAATGGAAT TTTTTTGTTT CATGT CTCAGAT TTAT T T T GT AT TTCT T T T T TAACAC TCTACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CT TGCAAAAGGGGAAGAAAGGAAT T GCGAATACATGT AAAATGTCACCAGACAT T TGTAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
C TAAAT T AAAAA 
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FIGURE id 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNTOGGIENTLEKEVMQYDYYSSYFDIFLIAVFRFKVLIIAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
( ; AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
J* CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
CI GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
C| TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
jffl CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
J* J TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
JI;f TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
Wl TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
|ll CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
i^Q CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
B ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
HI GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
Q GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
PI AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
ti GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
III GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACGTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGT GTGAC TC TAGT CTT GGCCCCAGGAAGCGAAGGAACAAAAGAAAC TGGAAAGGAAGAT GC 
TTTAGGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCT 
GGGAAGATGTTTT TCAAACT CAGAGACAAGGACTT TGGT TT TTGTAAGACAAACGATGATATGAA 
GGCC T TT TGTAAGAAAAAATAAAAGATGAAGT GTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLIJW^PFNCVCPLSWFGPWVRESHVTLJ^PEE 
TRCHFPPKNAGRLLLELDYADFGCPAT TTTATVPT TRPWRE PTALS S SLAPTWLS PTAPATE AP 
m SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
Q VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
Cl TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLL IAPALAAV 

f I LLAAIAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
m CEVPLMGFPGPGLQSPLHAKPYI 

Hi 

u] Important features: 

* „ Signal peptide: 

amino acids 1-23 
m Transmembrane domain: 

III amino acids 579-599 

Cl EGF-like domain cysteine pattern signature, 

fill 

*** amino acids 430-442 

Leucine zipper pattern, 
amino acids 197-219, 269-291 
ft-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



si 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGATGAAT CC T TAGATT CCAAGACTACTT TGACAT CAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTT GCTGGTCAAATAT TTCTT GAT T CAGAAGAAT C TGA 
AT TAGAATCCTCTAT TCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATATCAGCTTTCTAGAGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCT T TGACCGCCAT TGAAGGCACAGCACAT GGGGAGCCCT GCCACT TCCCTTTTCT 
TT T CCTAGATAAGGAGTATGATGAATGTACATCAGAT GGGAGGGAAGAT GGCAGACT GT GGT GTG 
C TACAACCTATGAC TACAAAGCAGATGAAAAGT GGGGCTT T TGTGAAACTGAAGAAGAGGCT GCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACT GGAATGAAAATCC T TAAT GGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATAT CGGTAT CT CCAAAAGGCAGCAAGCATGAACCATA 



ffi CCAAAGCCC T GGAGAGAGT GTCATATGC TC T T T TATT T GGT GAT TACT T GCCACAGAATATCCAG 

^0 GCAGCGAGAGAGATGT T TGAGAAGC TGACT GAGGAAGGCTCTCCCAAGGGACAGAC TGCT CT TGG 

i5 CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
™ TT GGAGCTCT TGGGGGCAAT C TAATAGCCCACATGGT T T TGGTAAGTAGACT T TAG TGGAAGGC T 

Pi AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 
fll CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
O AACACATGGAAT C TACAT GTAAATGAAAGT T GGTGGAGTCCACAAT T T T TCT T TAAAAT GAT TAG 

W TT TGGCTGAT TGCCCCT AAA7\AGAGAGATC T GATAAAT GGC T CTT T T TAAAT T TTC TCT GAGT T G 

GAATT GTCAGAATCAT T T T T TACAT TAGAT TATCATAAT T T TAAAAAT T T TT CT TTAGT T T T T CA 
AAATT T T GTAAATGGTGGCT ATAGAAAAACAACATGAAATAT TATACAATAT T T TGCAACAATGC 
CC TAAGAAT TGT TAAAAT TCATGGAGT TAT T TGTGCAGAAT GACT CCAGAGAGCTC TACT T TCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATT AACT CAT T TT TAAT AAAAT TAT GTCTAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRWIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCET^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALV Y YT FGALGGNL I AHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATTCAGATT T TAAGCCCAT TCTGCAGT GGAAT T TCAT GAACTAGCAAGAGGACACCATCT T CTT 
GTAT TATACAAGAAAGGAGT GTACCTAT CACACACAGGGGGAAAAATGCTCT T TTGGGT GC TAGG 
CC TCCT AAT CC TC TGT GGT T T TC T GT GGACT CG TAAAGGAAAAC TAAAGAT TGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGAT T TCATGT AATCGC TGCC TGTCTGACTGAAT CAGGATCAACAGCT TTAAAGGCAGA 
AACCTCAGAGAGAC T T CGT ACTGTGC T T CT GGAT GTGACCGACCCAGAGAAT GTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGT GCTGGC TCCCAC TGAC TGGC TGACAC TAGAGGACTACAGAGAACC TAT TGAAGT GAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGT GGAAGGT TT CAATGACAGCT TAAGACGGGACAT GAAAGCTT T T GGTGTGCACGTCTC 
ATGCAT T GAACCAGGAT TGT TCAAAACAAAC TTGGCAGATCCAGTAAAGGTAAT TGAAAAAAAAC 
TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
AGT CTAGACAAACTGAAAGGCAATAAAT CC TAT GT GAACAT GGACCTCT CTCCGGTGGTAGAGT G 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TT T TC T GGATACC T CT GT CTCACATGCCAGCAGC T T TGCAAGAC T T TT TAT T GTTGAAACAGAAA 
GCAGAGCTGGC TAATCCCAAGGCAGTGTGACTCAGC TAACCACAAATGT CT CC TCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCAT CTCT TAT CTAAAT AT TAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 2Q 



MLFWVLGLLILCGFLWTRKGKLKIEDI TDKYIFI TGCDSGFGNLAART FDKKGFHVTAACLTESG 
STALKAETSERLRTVXLDVTDPENVKRTAQWVKN^ 

REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNIJUDPVKVIEKK^ 

LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites • 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites . 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGC ATG GAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CAC TCGCT T TCCAGCACCT CAACACGGACT CGGACACGGAAGGT T TTCTTCTT GGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TACTGAT T CCCAAATGGAT GATG T TGAAGT TGT TTATACAAT TGA 
CATTCAGAAATATATTCCATGCTATCAGCT TT TTAGCT T TTATAATTCT TCAGGCGAAGTAAATG 
AGCAAGCACT GAAGAAAATAT TAT CAAATGTCAAAAAGAAT GT GGTAGG T T GGTACAAAT TCCGT 
CGTCAT TCAGATCAGATCATGACGTTTAGAGAGAGGCTGCT TCACAAAAACTTGCAGGAGCATTT 
T T CAAACCAAGACCT T GT T T T TCT GCTATTAACACCAAGTATAATAACAGAAAGC TGCTC TACTC 
AT CGACTGGAACATTCCT TATATAAACCTCAAAAAGGACTT T T TCACAGGGTACC T T TAGTGGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TTTTAGCCGAGCAGTACAAACACACAGCTCTAAAT T T TTTGAAGAAGATGGATCCTTAAAGGAGG 
TACATAAGATAAATGAAATGTATGC TT CAT T ACAAGAGGAAT TAAAGAGT ATATGCAAAAAAGTG 
GAAGACAGT GAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGAT T AAAACGAGAAAT TGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCGAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAAT CT GACC T TAAT GGTAGAACACACTGACAT TCCT GAAGCTAGT CCAGC TAGTACACCAC 
AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGT TGT TA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGT TT TGGT GAATATTCACGGTCTC 
CTACAT T T TGATCCT T T TAACC T TACAAGGAGAT T T T T T TAT T TGGC TGATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTTTGCAGT AATACACAGATAAC TCT TAGT GCAT T TACTT CACAAAGTACT T T T T CAAACATCA 
GATGCT T T TAT TT CCAAACCT T T T T TTCACCT T TCACTAAGT T GT TGAGGGGAAGGC TTACACAG 
ACACAT T CT T TAGAAT T GGAAAAGTGAGACCAGGCACAGTGGC TCACACC TGTAAT CCCAGCAC T 
TAGGGAAGACAAGTCAGGAGGAT T GAT T GAAGCTAGGAGT TAGAGACCAGCC TGGGCAACGTAT T 
GAGACCATGT C TAT TAAAAAATAAAAT GGAAAAGCAAGAAT AGCC T TAT T T TCAAAATAT GGAAA 
GAAAT T TAT ATGAAAAT T TATCTGAGT CAT TAAAAT T CT CCT TAAGTGATACT TTT T TAGAAGTA 
CAT TAT GGC TAGAGT TGCCAGATAAAAT GCT GGATAT CATGCAATAAAT T TGCAAAACAT CATC T 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PC YQL FS FYNS S GE VNE QALKKI L SNVKKNWGW YKFRRHS DQ IMT FRERLLHKNLQEH FSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE IEKMKG FGE YSRSPT F 



Important features: 
Signal peptide: 

amino acids 1-19 



%} N-glycosylation sites . 

3 amino acids 75-79, 322-326 

o 

PI N-myristoylation site. 

flj amino acids 184-154 

y 

III 



Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCC GCGCGGCGGAGGGCAGAGTCAGC CGAGCCGAGT CCAGCCGGACGAGCGGAC CAGCGCAGGGCAGC CCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
H AACAAGAACGAT GCCC C CCT CCACGAGAT CAACGGT GATCATTT GAAGATCT GT C CCCAGGGTT CTACCT GCT GCTCT 

Cl CAAGAGAT GGAGGAGAAGTACAGCCTGCAAAGTAAAGAT GATTT CAAAAGTGT GGTCAGCGAACAGTGCAATCATTT G 

Q CAAGCTGT C TTT GCTT CACGTTACAAGAAGTTTGATGAAT T CTTCAAAGAACTACTTGAAAATGCAGAGAAAT CCCT G 

4.1= AATGATATGTTT GTGAAGACATAT GGCCATTTATACAT GCAAAATTCT GAGCTATTTAAAGAT CT CTT C GTAGAGTTG 

W AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 

'I? I TTCCGCCTGGTGAACTCCCAGTACC7VCTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 

III 

5 CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

. r* 

Ifc? TTAGCGGTT GCGGGAGAT GT CGTGAGCAAGGTCTCCGT GGTAAACCCCACAGCCCAGT GTACCCATGCCCTGTT GAAG 

~ AT GATCTACT GCTCCGACTGCCGGGGT CTCGT GACTGTGAAGC CATGTTACAACTACTGCTCAAACAT CATGAGAGGC 

M TGTTT GGCCAACCAAGGGGATCT CGATTTTGAAT GGAACAATTTCATAGATGCTATGCT GAT GGT GGCAGAGAGGCTA 

GAGGGT CCTT T CAACAT T GAAT CGGT CAT GGAT CCCAT CGAT GT GAAGATTT CT GATGCTATTAT GAACAT GCAGGAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
* ^ TCCAT CTCT GAAAGTGCCTT CAGT GCT CGCTT CAGACCACATCACCCC GAGGAAC GCCCAAC CACAGCAGCT GGCACT 

W AGTTT GGAC CGACT G GTTACT GAT GT CAAGGAGAAACT GAAACAGGC CAAGAAATTCT GGT CCTCCCTTCCGAGCAAC 

HI GT TT GCAACGAT GAGAGGATGGCT GCAGGAAACGGCAAT GAGGATGACTGTT GGAATG GGAAAGG CAAAAGCAGGTAC 

CT GTTTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGT C CAGGT T GACACCAGCAAACCAGAC 
ATACTGAT CCTTC GT CAAAT CAT GGCT CTT CGAGT GATGACCAGCAAGAT GAAGAAT GCATACAATGG GAACGACGT G 
GACT TCTTT GATAT CAGT GATGAAAGTAGTGGAGAAGGAAGT GGAAGT GGCT GT GAGT AT CAGCAGTGC C CTTCAGAG 
TTTGACTACAATGCCACT GACCAT GCT GGGAAGAGTGCCAAT GAGAAAGC CGACAGT GCT GGTGTCCGTCCT GGGGCA 
CAGGCCTACCTC CTCACT GTCT TCT GCAT CTT GTT CCT GGTTAT GCAGAGAGAGT GGAGATAATTCT CAAACT CT GAG 
AAAAAGTGTT CATCAAAAAGTTAAAAGGCACCAGTTAT CACT TTT CTAC CAT C CTAGT GACTTT GCTTT TTAAAT GAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACT GT GCATTGAGTT GGTTCCT GCTCC CCCAAACCAT GTT AAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TT T TTTT CCAACTGT GAT CT CGCCT TGTTTCTTACAAGCAAACCAGGGTC CCTT CT T GGCACGTAACATGTAC GTATT 
TCTGAAATATTAAATAGCTGTACAGAAGCAG GT TTTAT TTATCAT GT TAT CTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24. 



MARFGLPALLCTIAVLST^ALLAAELKSKSCSEVRRLWSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPC YNYCSN IMRGCLANQGDLDFEWNNFI DAMLMVAERLEGP FNI ESVMDP I DVKI SDAIMN 
MQDNSVQVS QKVFQGCGPPKPLPAGRI SRS I SESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLL T VFC I L FLVMQRE WR 

Important features : 
Signal peptide: 



,?l amino acids 1-22 



O ATP/GTP-binding site motif A (P-loop) . 

Ill 



aiaino acids 515-524 



pi N-glycosylation site* 

Ill amino acids 514-518 



Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins, 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



y 



C T CGCCCTCAAATGGGAACGCTGGCC T GGGAC TAAAGCATAGACCACCAGGCT GAGTATCCT GAC 
C TGAGTCAT CCCCAGGGAT CAGGAGCCT CCAGCAGGGAACCT TCCAT TATAT T CT TCAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAAT TCAT GACAGT GTCT GGGC TGCCAAAGAAGCAGT GCCCCTGTGATCAT T 
TCAAGGGCAAT GT GAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT TCCAGA 
>»5 GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
!3 AGCGCCCACT CT TCCAAT TAAACAT TCTCAGCCAAGAAGACAGTGAGCACACC TACCAGACACTC 

I] TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 

!f TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 

P 

GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTTAACCTTAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 



it V * 



APP ID=10063569 



Page 177 of 320 



FIGURE 26 



MKVLISSLLLLLPLMIMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCC ATGG ACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCT GAC TCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTC T TCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCAT GGCT GAGAACAGGCACC T CCAAT AT GAGCGG T T TGT GGTGGCT CC T 

Mi 

^ GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
S% C T CTGT GCAGAGCCCAAGGAAGGTCCT GCAGGAGGTCCGGAGAGTACT GAGACCGGGAGGTGT GC 

|f| TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
Ujj GAGCCCACCTGGAAACACAT TGGGGATGGCTGCTGCCT CACCAGAGAGACCTGGAAGGATCTT GA 

|f| GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
^ GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
s T TCCCCAGCC TCCAAT TAGAACAAGCCACCCACCAGCCT AT C TATCT T CCACTGAGAGGGACC TA 

Q GCAGAATGAGAGAAGACATT CAT GTACCACC TACTAGTCCCTC TC TCCCCAACCTCTGCCAGGGC 

3 ES 

MS AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
^' CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
jj| CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGACT CAAAGAGGCAGAGGT TTTGTTCT CAAATAT TT T T TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 

MDILVPLLQLLVLLLTLPLHLMAL^ 

TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFW7VPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVTjQEWRVLRPGGVLFFWEHVAEPYGSW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQP I YLPLRGT 



.5 



Important features: 
Signal peptide: 

amino acids 1-23 



iy Leucine zipper pattern. 

m 

: r:; amino acids 10-32 

61 



N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTT GT TTATGATT CTTATACTAATT TATACAAAGATATTAAGGCCCTGTTCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGT TAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 



GT T TGAAT T CCT TCAACTATACCCACAGTCCAAAAGCAGACT CAC TGT GT CCCAGGC TACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T TGCAT CT GT T T TGAT AAAT GATGT T GACACCC TCCACCGAAT T CTAAGT GGAAT CATGT CGG 
GAAGAGATACAATCGTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
f*\ CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
Q AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
Ih TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
W ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 

in 

Jp AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 

k n TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

9 TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 

© GAT TCCGAGAATCATT GT CATGTACAT GCAAAACGCACT GAAAGAACAGCAGCATGGTGCATT GT 

s s$ 

fj CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
Kl AACCAGAAT GCATATACTACAAC TGCTAT TAATGGGACAGAT T T C T GTACATCAGCAAAAGAT GC 

m 

in AT T C AAAAT C T T G T C C AAG AAC T CAAGT CAC T T T AC AT C TAT T AAC T G C T T T GGAG AC T T C AT AA 

III TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCC TACT TTATGGATCAAGAATT T C TGAGTT TCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACTCCAGGCCAT TGTGAGATAGATACCCAT T T AGGTATC T GTACC TGGAAAACAT T 
TCCT TCTAAGAGCCAT T TACAGAATAGAAGATGAGACCAC T AGAGAAAAGTTAGTGAAT T T T T T T 
T TAAAAGACCTAATAAACCC TAT TC T TCCTCAAAA 



APPJD=1 0063569 



Page 183 of 320 



FIGURE 32 

MSGRDT I LGLC I LALALSLAMMFT FRFI T TLLVH I F I SLVILGLL FVCGVLWWL YYDYTNDLS IE 
LpTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLS I LFFYHQGTWKGS FLI S WRI PRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 
amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites* 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 3 3 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATT T GTATGGGAT T T GT GATGCAGGAAAGCC TAAGGGAAAAAGAATATT CAT TCTGTGTGGT 
GAAAAT T T T T TGAAAAAAAAAT T GCC T T CT T CAAACAAGGGT GTCAT T C T GATAT T TATGAGGAC 
TGTTGTTCTCAGTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T CAAACAAAGAAAC GGCAAAGAAGAT TAAAAGGCCCAAGT TCACTGT GCCTCAGATCAACTGC 
GATGTCAAAGCC GGAAAGAT CAT CGAT CCT GAGT TCAT T GT GAAATGT CCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGT CTTAGAAAGT AAACCCAAAAAGGGTGTAACC TACCCATCAGCT CT TACATACT CAT CAT 
N a CGAAAAGTCCAGC T GCCCAAGCAGGTGAGACCACAAAAGCC TATCAGAGGCCACC TATT CCAGGG 

|3 ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
n CACCTTGCCAAGGCCA.TCCCCTTCTGCTGCTTCTACCACCAGCATCCCC7^GACCACAATC7U;TG^ 
GCCACAGGAGCCAGGAGATGGAT C TC TGGTCCACT GCCACCTACACAAGCAGCCAAAACAGGCCC 
J"! AGAGCT GATCCAGGTATCCAAAGGCAAGATCC T T CAGGAGCTGCC T TCCAGAAACC TGT T GGAGC 

GGATGT CAGCCT GGGACTTGTTCCAAAAGAAGAAT TGAGCACACAGTCT T T GGAGC CAGT AT CCC 
111 TGGGAGATCCAAAC TGCAAAAT TGACTT GTCGT T T TTAATT GATGGGAGCACCAGCATTGGCAAA 

111 CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
*f\ CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
ACACGAATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAAT 

* GTAGGT CGGGCCATCT CCTT TGT GACCAAGAACT TCT T T TC CAAAGCCAAT GGAAACAGAAGCGG 

13 GGCTCCC^TGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
IB GAC T TGCGAGAGAGTCAGGAATCAACAT TT TC TTCAT CACCAT TGAAGGT GCT GC TGAAAAT GAG 

f*i AAGCAG TATGTGGT GGAGCCCAAC T T T GCAAACAAGGCCGT GTGCAGAACAAACGGC TT CTACTC 

Jff GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
W ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
CJ GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
fff AGAGTT T GAGAT T TCCGACACGGACACGCGCATCGGGGCCGT GCAGTACACCTACGAACAGCGGC 

TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGT TAAT GAT CC TCATCACCGACGGGAGGT CCTACGACGACGT CC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGC TAGAAGT CAT T GCCACTCACCCCGCCAGAGACCACT CCT TC T T TGTGGAC GA 
GT T TGACAACCTCCAT CAGTATGTCCCCAGGATCATCCAGAACAT T TGTACAGAGT TCAACT CAC 
AGC C T C G GAACTGAAT T C AGAGCAGGCAGAG CACCAGCAAG TGCTGCTT T AC TAAC T GAC G T G T T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGT CT T GTTAT TAT TCT T TGCCAT CAT GC TT T T TCATAT TCCAAAACTT GGAGT TACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGT TGAGGGTGCTGGAGATTT TACAT 
TT TGACAATTGT TT TCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T TGT CAT GACAATGTAGGAAT TGC T GAAT TAAATGT TT AGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAA?\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S FIVLESKPKKGVT YPSALTYS S SKS PAAQAGE TTKAYQRPP I PGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKI TQRGGLSNVGRAI S FVTKNFFSKANGN 
RSGAPNWVVMTOGWPTDKTCEASRLARE 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILI TDGRSYDDVRI PAL1AAHLKGVITYAIGVAWAAQEELEVIATHPARDHS FF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCGAGC AC AGGAGATT GCCTGC GT TT AGGAGGTGGC TGCGTT GTGGGAAAAGCT AT C AAGGAAGAAATTGC 
CAAACC ATGT CT T T T TTT CTGT TT TC AG AGTAGT TC AC AACAGAT CTG AGTGTTTT AATT AAGCATGGAAT 
ACAG AAAACAAC AAAAAACT T AAGCTTT AATTTCAT CTGGAATTCCAC AGTT TT CT T AGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT CAGCCT T C CCCACT AC AATGTGAT AGAACGCGT GAACT GGATGT ACTTCT AT GAGT AT GAGCCGAT TT A 
§ I CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
p! TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
Q TGGTGGGGAT AT GAGGTT CT T ACATTTT TCTT ATT AGGCC AAGAGGCT GAAAAGGAAGAC AAAATGTT GGC 

Irt AT TGTCCT TAGAGGAT GAAC ACCT TCTT TATGGTGACAT AAT CCGACAAGAT TT TTT AGACACAT ATAAT A 

W ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 

If! 

^ 2 AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 

iff! 

GAAGTT TT TC AC AGGT T ATCCTCT AATTGAT AATTATT CC T AT AGAGGATT TT ACC AAAAAACCCATAT T T 

*p 

CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TTGGTGCC AAGG AT CT AT GAAATGAT GGGT CACGTAAAACCC AT C AAGTTT GAAGATGTT T ATGTCGGGAT 
||| CTGTTTGAAT TT ATT AAAAGTGAACATTCATATT CC AGAAGAC ACAAATCTT TTCTTT CT AT AT AGAAT CC 

p| ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
flj TG GCAGGTCAT GCTAAGG AAC ACC AC AT GCCATT AT TAA CTTCAC ATT CT AC AAAAAGCC TAGAAGGACAG 

CI GATACCTT GTGG AAAGTGTT AAAT AAAGTAGGT ACT GT GGAAAAT T CATGGGGAGGTC AGTGTGCT GGCT T 

III AC AC TGAACT GAAACT CATGAAAAACCC AGACTGGAGACT GGAGGGT T AC ACTT GTGATT T ATT AGTCAGG 

CC CT TCAAAGATGATATGTGGAGGAATT AAAT AT AAAGGAAT TGGAGGTT TT TGCT AAAGAAAT T AAT AGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CT AGGCTGT AAAAACAAAAC AATGTAGAGTTT T ATT T AT T GAACAAT GT AGTCACTTGAAGGT T TTGTGT A 
T ATCT T AT GT GGAT T ACCAATT TAAAAAT AT ATGT AGTT C T GTGTC AAAAAACT TCT TCACTGAAGTT AT A 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT T ATTT AAAATTACT T C AACTT TGTGT T T TT AAATGTTT T GACGATT T C AAT ACAAGAT AAAAAGGAT AG 
TGAATC AT TCTTTACAT GCAAAC ATT TT CC AGTTAC TT AACT GAT C AGT TTATT AT T GAT ACAT CACTCCA 
TT AATGTAAAGTC AT AGGTCATT ATT GC AT ATCAGT AATCT C TT GGAC T T TGTT AAAT AT TTT ACT GT GGT 
AAT AT AGAGAAG AATT AAAGCAAG AAAATCT GAAAA 
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FIGURE 36 

MASM,WTVLPSRMSLRSLKWS^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP I KFEDVYVG I CLNLLKVNI HI PEDTNL FFLYRI HLDVCQLRRVT AAHGFS SKE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
T CGTCCTTGCCAAGAGAGTACACAGTCAT TAATGAAGCCTGCCC T GGAGCAGAGTGGAATATCATGTGTCGGGAGT GC TGT G 
AATATGATCAGATTGAGTGC GTC TGCCCCGGAAAGAGGGAAGT C GTGGGTT ATACCATCC CT TGCTGC AGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACC T TGGAT GACT TC TATGTGAAGGGGTTCT ACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGC GAT GT G 
GCC AGGT T C TGCGAGCCCCAAAGGGTCAGAT T T TGT TGGAAAGC TATC CCC TAAATGCTCAC TGTGAATGGACCAT TCAT GC 
TAAACCTGGGTT TGTC ATCCAACTAAGAT TT GTCATGTTGAGTC TGGAGTT T GAC T AC ATGTGC C AGT ATGAC TAT G T T GAG 
GTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAG 
GATCC TCACTCCACGT CC TC T TCC ACT CCGATGGCTCCAAGAAT TTTGACGGT TTCC ATGCCAT T TATGAGGAGAT CACAGC 
ATGC TCC TCATCCCC T TGTTTCCATGACGGCACGT GC GTCC TTGACAAGGC TGGATC TTACAAGT GTGCCT GCTTGGCAGGC 
TATAC TGGGCAGCGC T GTGAAAATC TCC T TGAAGAAAGAAAC TGCTCAGACCC TGGGGGCCCAGTCAATGGGT ACC AGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
| k | T C TTAGTGGCAATGAGAAAAGAACT TGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATC TGCATAAAAGC C T GCCGA 

5ff GAAC CAAAGATT TCAGACCTGGTGAGAAGGAGAGT T C TTCC GAT GCAGGTTC AGTCAAGGGAGAC ACCATTAC ACCAGC TAT 

; br 5 

tf\ ACTCAGCGGCC T TCAGCAAGC AGAAAC TGCAGAGTGCCCCTACCAAGAAGCCAGCCC TTCCC T T TGGAGATC T GCCCATGGG 

ATACCAACATC T GCATACCCAGC TC CAGTATGAGTGC ATC TCACCCTT C TACCGCCGCC TGGGCAGCAGCAGGAGGACATGT 
C TGAGGAC TGGGAAGTGGAGTGGGC GGGCACCATC C TGCATCCC TATC TGC GGGAAAATTGAGAACATCAC T GCTCCAAAGA 
CCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCCATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCATGATCAAGACAGC AGACC TGAAAGT TGTT T TGGGGAAATTC TAC CGGGAT GATGACCGGGAT GAGAAGACC ATCC 
AGAGCCTACAGAT TT C TGC TATCAT TCTGCATCCCAACTATGACCCC ATCC TGCT T GATGC TGACATCGCCAT CCT GAAGC T 
CCTAGACAAGGCCCGT ATCAGCACCCGAGTCCAGCCCATCTGCC TCGCTGCCAGT C GGGATC TC AGCACTTC CTTC C AGGAG 
T CCCACATCACT GTGGCTGGC TGGAATGTCCT GGCAGACGTGAGGAGCCCT GGCT T C AAGAACGACACACT GCGC TC TGGGG 
T GGTCAGTGTGGTGGACTCGC TGCTGTGTGAGGAGCAGCAT GAGGACCATGGC ATC CCAGT GAGTGTCACT GATAAC ATGT T 
C TGT GCCAGCTGGGAACCCAC TGCCCC TTCTGATAT C TGCAC TGCAGAGACAGGAGGCAT C GCGGCTGTGT C CT TCC CGGGA 
C GAGC ATC TCC TGAGC CAC GC TGGCATCTGATGGGACTGGT CAGCTGGAGC TATGATAAAACATGCAGCCAC AGGC TCTCCA 
C TGCC TT CACCAAGGT GC T GC CT TT TAAAGAC TGGATTGAAA,GAAATATGAAATGAACCAT GC TCATGCAC TCCT TGAGARG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCC AGGGCT T C TGAC TTCAGGGACAAAACT C AGTGAAGGGTGAGTAGACC TCC AT TGCT GGTAGGC TGAT GCC GC GTCCA 
C TAC TAGGACAGCCAAT TGGAAGAT GCCAGGGC TTGC AAGAAGTAAGT T TC T TCAAAGAAGACCATATACAAAAC C T C TCCA 
CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGT ACATGGC C ACAGTACAGTCT GGTCCTT T TC CT TCC C CATCTC T TGTACAC AT T T TAAT AAAATAAGGGTT GGCT TC T 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP I QS IGS SLHVLFHSDGSKNFDGFHAI YEE I TACS S S PC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPWGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGPCWSGRAPSCIPICGKIENITAP 

fa 

KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
P KWLGKFYRDDDRDEKT I QSLQI S AI I LHPNYDP I LLDADI AILKLLDKARI S TRVQPI CLAASR 

CP DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 

$ .. 5 

SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
%l ERNMK 

Important features of the protein: 
Signal peptide : 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 48 9-506 
CUB domain proteins profile. 

amino acids 150-167 



•J! 5?^ 
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FIGURE 3Q 



GGTT CCTACAT CCT CT CATCTGAGAAT CAGAGAGCATAAT CT TCTTACG GGCCCGT GAT TTATTAACGTGGCTT AAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGC7VCACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
T GGT CT T GC CTT GGCTCAGT CCTGCTAACTACATT GACAAT GTGGGCAACCT GCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCT CCCACTAC GGCCTGACCAAAGATAGGAAGAGGCGCT CACAAGAT GGCTGT C CAGACGGCT GT GCGAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCT GC CTAC GTGT CCTCGGCAGAGGAC GGGCAGCCAGCAAT CAGCCCAGT GGACT CT GGC CGGAGCAACCGAACTA 
GGGCAC GGCCCT TT GAGAGAT CCACTATTAGAAGCAGAT CAT TTAAAAAAATAAAT CGAGCTTT GAGTGTT CTT CGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAAC CAT GCCGACCAGGGCAGGGAAAATT CTGAAAACACCACT GCCCCT GAAG 
TC TTT CCAAGGTTGTACCACCT GATT C CAGATGGT GAAATTACCAGCAT CAAGAT CAATC GAGTAGATC CCAGT GAAA 
GCCT CT CT AT TAG GCT GGT GGGAGGTAGCGAAACCC CACT GGT C CATAT CATTAT CCAACACATTTAT CGTGAT GGGG 
T GAT C GCCAGAGACGGC CGGCTACT GCCAGGAGACAT CAT TCTAAAGGTCAAC GGGAT G GACAT CAG CAAT GT C CCT C 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAATGGACAGGCC CC GGATGCCTACAGACCC CGAGAT GACAGCTTT CATGT GATT CTCAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GT GGCATATCGACAT GGTCAGCTTGAG GAGAATGACCGTGT GTTAGCCAT CAATGGACATGAT CTT C GATAT GGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGC CCTGACAT CTT T CAGGAAGC CGGCT GGAACAGCAATGG CAGCT GGTC CCCAGGGC CAGGGGAGAGGAG CAACA 
CT CC CAAGCCCCT CCAT CCTACAATTACT T GT CATGAGAAGGT GGTAAATAT CCAAAAAGACC CC GGT GAATCTCTC G 
GCAT GACC GTCGCAGGGGGAGCAT CACATAGAGAATGGGATTT GCCTAT CTAT GT CAT CAGT GT TGAGCCCGGAGGAG 
TCATAAGCAGAGAT GGAAGAAT AAAAACAGGT GACAT TTTGTT GAAT GT G GAT GGGGTCGAACT GACAGAGGTCAGC C 
GGAGT GAGGCAGT GGC^VTTATT GAAAAGAACATCATCCT CGATAGTACTCAAAGCTTT GGAAGT CAAAGAGTAT GAGC 
CC CAGGAAGACT GCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACAT GGCCC CACC CAGT GACTGGTCCCCAT CCT 
GG GTCAT GT GGCT GGAATTACCACGGT GCTTGTATAACT GTAAAGATATT GT ATTAC GAAGAAACACAGCT GGAAGTC 
TGGGCTTCT GCATT GTAGGAGGT TATGAAGAATACAAT GGAAACAAACCTTTTT T CAT CAAAT C CATT GTT GAAGGAA 
CACCAGCATACAAT GATGGAAGAATTAGAT GT GGTGATATTCTT CTT GCT GT CAATGGTAGAAGTACATCAGGAATGA 
TACAT GCTT GCTT GGCAAGACT GCT GAAAGAACTTAAAGGAAGAATTACT CTAACTATTGTTTCTTGGCCT GGCACTT 
TTTTATAGAATCAAT GAT GG GT CAGAGGAAAACAGAAAAAT CACAAATAGGC TAAGAAGT T GAAACACT ATATTTAT C 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACACCTCAGAAAATATGATT CCAAAAAAATTAAAACTACTAGT TT TT TTTCAGT GTGGAGGAT TT CT CATTACTCTAC 
AACATTGTTTATATTTTTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCT GAT TTAAATTTAAAATTTGGTATATGCTGAAGTCT GCCAAGGGTACATTAT GGC CATTTT TAATTT ACAGCT 
AAAAT AT TTTTTAAAATGCATT GCT GAGAAACGTT GCT TT CAT CAAACAAGAATAAATATTT TTCAGAAGTTAAA 



FIGURE 4Q 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PE VS AAAT I SLMTDEPGLDNPAYVS SAEDGQPAI S PVDS GRSNRTRARP FERST IRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKSS PEEQLGI KLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGS PE SAAHL I QASERRVHLWSRQVRQRS PD I FQEAGWNSNGS WS PG 
| 4 , PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
Q IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
13 SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 

W * RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

I s i 

sfs 
3 

||| Important features: 

; HJ Signal peptide: 

i? amino acids 1-15 

^ N-glycosylation sites. 

Hi amino acids 108-112, 157-161, 289-293, 384-388 

S 

H? Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 



N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 4.1 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTT CGTGGGTGCCAT T CAAGAGAT TCCTAAAGCAAAGGAGTTCAT GGCTAATTTCC 
ATAAGACCCTCATTT TGGGGAAGGGAAAAACT CTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCT GTGT CT CCT TACCTCAGAGGCCAGAGCAAGCT CATT TT CAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
^ CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
PI CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
m TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 

5j{i a 

m ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
If] CATCCCAAGCATCTGGT GGTT GGCAGGAACAGCACT GGGTACAGGTTACGT TACAGT GGAT ATTTTGG 

lp GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
U ^ GGGGAGGC GAAGACGAT GACCT CAGAC T CAGGGT T GAGCT CCAAAGAAT GAAAAT TT CCC G GC C CCT G 

*L CCTGAAGT GGGTAAATATACAAT GGTCTTCCACACTAGAGACAAAGGCAAT GAGGTGAACGCAGAACG 

lt% GATGAAGCTCTTACACCAAGT GTCACGAGTCT GGAGAACAGATGGGT TGAGTAGTTGTTCT TATAAAT 

p TAGTATCT GTGGAACACAATCCT TT ATATAT CAACAT CACAGTGGATT TCT GGTTTGGTGCATGACCC 

f|| TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
CJ ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
W TAGCAGAGCTCCT GGTGATGTAGAGTATAAAACAGTT GTAACAAGACAGCTTTCTTAGTCATTT TGAT 

CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AATGAACGCTATTTGAGGACT CTGGTTGAAGGAGATTTATT TAAATT T GAAGTAATATAT TATGGGAT 
AAAAGGCCACAGGAAATAAGACT GCTGAATGTCTGAGAGAACCAGAGT TGT TCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCT GTACAATCATC T GTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGT GAACTT GGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGT GATGCCCACCAGAGAATACATTCT CTAT TAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
ATTAACTAATAATAAA.TAT GT CTAT CAAATACCT C T GTAGT AAAATG T GAAAAAG CAAAA 
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FIGURE 42 

MGFNLT FHLS YKFRLLLLLTLCLT WGWAT SNYFVGAI QE I PKAKEFMANFHKTL I LGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE A3 



GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAACT T GCAGAGCTGCAACCCCAGGACAGAGCT GGAGCCAGGGCCAGC TGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 
TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGS 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 



U amino acids 58-59 



N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCT T CAACACAACCCCTCTTGT CACCAT ACAGCCAGAAGGGGGCACTATCATAG TGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACT TCCCAGATGGAGGCTACT CCCTGAAGCT CAGCAAAC TGAAG 
AAGAATGACTCAGGGAT CTAC TAT GTGGGGATATACAGC TCATCACTCCAGCAGCCC T CCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACC TGT GT GACCAATC T GACATGCTGCATGGAACATGGGGAAGAGGAT GTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGT GATATGACC TT CATC T GCGTT GCCAGGAACCCTGTCAGCAGAAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTT TCTGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGTGGACAT TTGTCGGG 
AAACTC CTAACATATGCCCCCAT T C TGGAGAGAACACAGAGTACGACACAATCCCTCACACT AAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAT^AAGAT 
GGAAAATCCCCAC T CACT GCTCACGAT GCCAGACACACCAAGGCTATT T GCCTATGAGAAT GT TA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 

VARNPVSRNFS S P I LARKLCEGAADDPDS SMVLLCLLLVPLLL S L FVLGL FLWFLKRERQEE Y IE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 

PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCT CGAGCGT T TCTGAGCCAGGGGTGACCATGACCT GC T GCGAAGGATGGACATCC T GCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCAT TCCAGCAACAACAATGTCCT TGACAGCAAGAAAAAGAGCGTG 
C TGCAACAACAGAACT GGAATGT T TC T TT CAT GATT T T T CAGTGTGAT CACAGTCAT TGGT GCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAAT GCCAAT TGT GAAT T T TCATTGAAAAACAT CAGTGACAT T CATCCAGAATCCT T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
AC ACCATGGCGAGTGGCT GGAGAGCAT CTAGT T TCCAC T TCGAT T CTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG T TTAAT GGGAATAAAATGTAAGTAT CAGTAGTT TGAAAAAAAAAAA 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMA.IPA 
TTMS LTARKRACCNNRTGMFLS S FFS VI TVI GAL YCML I S I QALLKGPLMCNS PSNSNANCE FS L 
KNISDIHPESFTSTLQWFF^DSCAPPTGFNKPTSNDTMASGWRASSFHFDSEEISIKHRL 
LLVGILEVLFGLSQIVIGFLGCLCGVSfCRRSQIV 

Important features: 
Transmembrane domains : 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGT GAGGAACCT GCAGAGTCACACAGT T GCT GACCAAT TGAGCTGT GAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT TT GCCAATAAAGACGATCCC T TC T ACTATGAC TGGAAAAACC TGCAGCT GAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
C TAC T T G C TGAG CACAG GAC T GG C CT C C AGG GAT GGC C T GAAGCC T AACAC T GGC C CC CAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLIAIAGIAAVLSGKCKYKS 
S QKQHS PVPEKAI PL I T PGSATTC 



Important features: 
Signal peptide: 

amino acids 1-16 

13 Transmembrane domain: 

J: amino acids 36-59 

||| N-myristoylation sites. 

|f! amino acids 41-47, 45-51, 84-90 



5 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 



GTGGAC TCTGAGAAGCCCAGGCAG TT GAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGACCC T GGTGGGAGGAAGACAC TC TGGAGAGAGAGGGGGC TGGGCAGAGATGAAGT TCCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCAC T GGGACAAATAT T GGGGAGGCCC TTGGACATGGCCTGGGAGACGCCC TGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 

G GGCCACC AAAC T T T GGGAC CAACAC T CAGG GAGC T G T GGC C CAGC C T GGCT ATG G T T CAGT GAG 

AGCCAGCAACCAGAAT GAAGGGTGCACGAAT CCCCCACCAT CTGGC TCAGGT GGAGGCTCCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

CT TCAGAGGACAGGGAGTT TCCAGCAACAT GAGGGAAATAAGCAAAGAGGGCAATCGCCT CCT TG 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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m 



MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGH^ 

EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPOTHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGWTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-2 69 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



|3 Casein kinase II phosphorylation site. 

fll amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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o 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGG ATG TCGCTGCTGAGCCTGCCCTGG 

CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 

ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 

CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 

TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 

CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 

TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAG7VA.GGGATACTGCTG 

AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 

GAAGT CCTATATAACGATCTTCAACAAGAGT GCAAACATCATGCT T GACAAGTGGCAGCACC T GG 
tt s - 
1 s 1 CC TCAGAGGGCAGCAGTCGTC TGGACAT GT TTGAGCACAT CAGCCT CAT GAC CTT GGACAGTCTA 

If! CAGAAATGCATCT TCAGCT TTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATAT TGCCACCAT 

CT TGGAGC T CAGT GCCCTT GTAGAGAAAAGAAGCCAGCATAT CC TCCAGCACATGGACT T TCTGT 

ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

If! AGCCAAGT CCAAGACT T TGGAT TTCAT T GAT GTGC TT CTGCT GAGCAAGGAT GAAGATGGGAAGG 

O CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 

GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 

CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 

GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 

GC TGC T GCAC T T CCGG T T C C T GC CAGAC CACAC T GAGC C CC GCAGGAAG C T G GAAT T GAT CAT GC 

GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 

CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 5 4 

MSLLSLPWLGLRPVi^SPWLLLLLWGSWL]^ILAWTYAFYI^CRRLQCFPQPPKRNWFWGHLG 
LI TPTEEGLKDS TQMSATYSQGFTWLGPI I PFI VLCHPDT IRS I TNASAAI APKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
M LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAFI PFSAGPRNC I GQAFAMAEMKVVLALMLLHFRFLPDHTE PRRKLEL IMRAEGGLWLRVE PLN 

%** 

m VGLQ 

111 Important features: 

Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

g 

Iff Cytochrome P450 cysteine heme-iron ligand signature. 

1*1 amino acids 4 61-4 71 

fit 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 5 5 



AT CGCATCAATT GGGAGTACCATCT TCC T CATGGGACCAGTGAAACAGCTGAAGCGAAT GTT T GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCAT GGCCAGT TTTATGAAGCTT TGGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATT CGAACAT TTGAGGGT TACTT T T GGAAGCAACAATACAT TC TCGAACCTGAATGT CAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
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FIGURE *6 



MGPVKQLKRMFE PTRL I AT IMVLLC FALTLCSAFWWHNKGLAL I FC I LQSLALTWYSLS FI P FAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTC(^GCCGAATCGGCTCGAGGGGC^GTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
AT TT TCAAGCT CAGT GTCTT CATCCCCTCC CAGGAATTCTCCACCTACCGC CAGT GGAAGCAGAAAATT GT ACAAGCT 
GGAGATAAGGACCTTGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGAGG 
CT GGTGTTTAAGATTT T GGACAAAAAGAAT GATGGACGCATT GACGCGCAGGAGATCAT GCAGT CCCTGCGGGACTT G 
GGAGT CAAGATATCT GAACAGCAGGCAGAAAAAATT CTCAAGAGCAT GGATAAAAACGGCACGAT GACCAT CGACT GG 
AACGAGTGGAGAGACTACCyVCCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
ATCTTT GAT GT GGGT GAGAATCTAACGGT CCC GGATGAGT T CACAGT GGAGGAGAGGCAGACGGGGAT GT GGTGGAGA 
CACCTGGTGGCA.GGAGGTGGGGC^GGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
T CACT CT GGC GGGGCAATGGCATC AAC GT CCT CAAAATT GCCCCCGAAT CAGCCATCAAATT CATGGCCT AT GAGCAG 
AT CAAGCGCCT T GTTGGTAGTGACCAGGAGACTCTGAGGATT CACGAGAGGCTT GT GGCAGGGT CCTT GGCAGGGGCC 
AT CGC CCAGAGCAGCAT CTAC CCAATGGAGGT CCT GAAGACCCGGATGGCGCTGC GGAAGACAGGCCAGTACTCAGGA 
AT GCT GGACT GCGCCAGGAGGATC CT GGCCAGAGAGGGGGTGGCC GCCTT CTACAAAGG CTAT GT CC CCAACAT GCT G 
GGCAT CATC CCCTAT GC CGGCATC GACCT T GCAGTCTACGAGACGCT CAAGAAT GCCT GGCT GCAG CACTAT GCAGT G 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCT GGCC CTAGT CAGGAC CCGGATG CAGGC GCAAGCCT CTATT GAGGGCGCTCCGGAGGT GAC CATGAGCAGCCT C 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCAT(^GCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCT GTCT CGAGCCAAGCT GTGAAAACCCTAGACGCAC CC GCAGGGAGGGT GGGGAGAGCT GGCAGGCC CAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAATCCAT GAT GAAAGGT GAGGTCAC GT GGCCT CCCAGGCCTGACTTCCCAACCTACAGCATT GACGC CAACTTGGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
CCAT GTT T GAGGGCGAAGGGCAGAGCGTTT GT GT GTT CT GGG GAGGGAAGGAAAAGGTGTTGGAGGC CT TAAT TATGG 
ACT GTTGGGAAAAGGGTTT T GT CCAGAAG GACAAGCCGGACAAAT GAGCGACTT CTGTGCTT C CAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGAC CAGCC CCACATT C CACTT GT GT CACTGCTTGGAACCTATTTATT TT GTATTTAT TTGAACAGAGTTAT GT CCT 
AACTATTTTTATAGATTT GTTTAATTAATAGCTT GTCAT TTTCAAGTT CATTTTTTATT CATATTTAT GTTCAT GGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGC^GAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GT T GGGGAAGGGCTT GCCCCCAGC CTTAGGATTT CAGGGTTT GACTGGGGGCGT GGAGAGAGAG GGAGGAAC CTCAAT 
AAC CTTGAAGGT GGAAT C CAGTTATTT CCT GCGCT GCGAGGGTTT CT TTATTTCACTCT TTT CT GAAT GT CAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAAT CTT CAATAGGATGCAAAGAT CAAT GCAAAAATT GTTATATATGAACATATAACT GGAGT C GTCAAAAAG 
CAAATTAAGAAAGAATT GGACGTT AGAAGT TGT CATTTAAAGCAGCCTT CTAATAAAGTT GTT T CAAAGCTGAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFKILDKKNDGRI DAQE IMQS LRDLGVKI SEQQAEKILKSMDKNG 
TMTIDWNEWRDYHLLHPVENIPEIILYWKHSTIFDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVPNMLGI IPYAGI DMVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALTOTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGIAPNFMKVIPAVSISW 
VYENLKITLGVQSR 



Important features: 
Signal peptide: 



amino acids 1-16 



Putative transmembrane domains: 




amino acids 284-304, 339-360, 376-394 



Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 



N-glycosylation sites. 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
T TCC CT GGGGCAGATCCTCT TCT GGAGCATAAT TAGCAT CAT CAT TAT T CT GGC TGGAGCAAT TG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA?VCTTTCTGATAT 
CGTGATACAAT GGCT GAAGGAAGGTGT T T TAGGC T TGGTCCATGAGT TCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
AAGTGAAT GT GGAC TATAAT GCCAGC T CAGAGACC TT GCGG T GT GAGGCT CCCCGATGGT TCCCC 
CAGCCCACAGTGGTCT GGGCATCCCAAGTT GACCAGGGAGCCAAC T TC TCGGAAGT CTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATAC TCC TGTATGATT GAAAATGACAT T GCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCT TGGCCACAAAAAAGCATGCAAAGT CAT TGTTACAACAGGGAT CTACAGAAC TAT TT CAC 
CACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
C TATCT TCAAAGACATATTAGAAGT TGGGAAAATAAT T CAT GTGAAC TAGACAAGT GTGT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCT GGGGAGTGAGAGGACAGGATAGTGCATGT TCT T T GT C TCT GAAT TT T TAGT TAT ATGT GC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT TCCCAAC TGACAAATGCCAAAGT TGAGAAAAATGAT CATAAT T T TAGCAT AAACAGAGCAGT 
CGGGGACACCGAT T TTATAAATAAAC T GAGCACC T TCT T T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAMAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 

DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 

YIITSKGKGN7\NLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 

NTSFELNSEOTTMKWSVLYNVTINNTY^ 

CVS S FFAI SWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites* 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites, 

amino acids 52-58, 126-132, 188-194 
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FIGURE 6l 

TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGT GGACTCAGATCACAGTGGCTATAT CT CCAT GAAGGAGCTAAAGCAGGCCCTGGTCAACT GCA 
AT T GGTCT T CATTCAAT GAT GAGACCT GCC TCATGATGATAAACATGTT TGACAAGACCAAGT CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
C CAGCAGTAT GACCGGGACCGCTCGGGC TCCAT TAGCTACACAGAGCTGCAGCAAGCT CTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTGACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAAT CC T GCCATGCAGCT TGACCGCT TCATCCAGGT GT GCACCCAGCTGCAGGTGC T GACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGT GAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGT TGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGC CAAAAGT C CAGT GAAAT T GTAAGC T TCAATAAAAGGATGAAAC TCTGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMMI 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 



o 

fU N-myristoylation sites. 



amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
W CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

Z GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

m CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

is? | 

p GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
I'll TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
G T GC TCAT CT CCAATAAAATAAAAGCACT TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAA 



APP ID=1 0063569 



Page 215 of 320 



FIGURE 6 J. 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPi^DSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTREMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6 s 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
^ CTGAGTGGCAATAAATAAAATTCGGTATGCTd 

0 

#** 



yi 



iu 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAGG TCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCC TGAGGTCAAGAGAGGAT GGGGCTAT T CAC T T TTATAT AT T TATATAAAAT TAG 
TAGTGAGAT GTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAMjLIAVGLALLVRK^ 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 



ft 



Iff amino acids 8 6-90 



Tyrosine kinase phosphorylation site. 

amino acids 8 6-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
T CAAATACTTCCTT CATTAAGCTGAATAATAATGGCT T TGAAGATAT TGT CATT GT TATAGATCCTAGT GT GCCAGAA 
GAT GAAAAAATAAT TGAACAAATAGAGGATAT GGT GACTACAGCTT CTAC GTACCT GTTT GAAGCCACAGAAAAAAGA 
TTTTTTTTWWWVTGTATCTATATTAATTCCTGAGAATTGGAAGGA 
AACCATAAAGATGCTGATGTTATAGTTGCAC 

GAAT GT GGAGAGAAAGGCGAATACAT T CACTT CACC CCT GACCT TCTACTT GGAAAAAAACAAAAT GAATATGGACCA 
C CAGGCAAACT GTT TGT CCAT GAGT GGGCTCACCTC CGGTGGGGAGTGTTTGAT GAGTACAAT GAAGAT CAGCCTTT C 
TAC CGT GCTAAGTCAAAAAAAATCGAAGCAACAAGGT GTTC CGCAGGTATCT CTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GTCTTAGTAGAGCAT GCAGAATT GATTCTACAACAAAACTGTATGGAAAAGAT T GT CAATTCTT T 
C CT GATAAAGTACAAACAGAAAAAGCATC CATAATGT TTAT GCAAAGTATTGAT T CTGTTGTTGAATTTTGTAACGAA 
AAAACC CATAAT CAAGAAGCT CCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACAT GGGAGGT GATTAGCAAT 
f"' TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
Q AGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
|P| AAACATT T CCT GCT GCAGACT GTT GAAAATGGAT CCT GGGT GGGGAT GGTT CACTT TGATAGTACTGCCACTATTGTA 

AATAAGCTAAT CCAAATAAAAAGCAGT GATGAAAGAAACACACT CAT GGCAGGATTACCTACATAT CCT CT GGGAGGA 
IP ACTTCCIATCTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGATCCGAAGTA 
f a | CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
?2 TTTATT GCTTT GGGAAGAGCTGCT GAT GAAGCAGTAATAGAGAT GAGC AAGATAACAGGAGGAAGTCAT TT TTATGT T 

ill T CAGAT GAAGCTCAGAACAAT GGCCT CATTGATGCTT TT GGGGCT CTTACAT CAGGAAATACTGAT CTCT CC CAGAAG 

ffl T CCCTT CAG CT C GAAAGTAAGGGATTAACACTGAATAGTAATGCCT GGATGAACGACACTGTCATAATT GATAGTACA 

GT GGGAAAG GACACGTT CTTT CT CAT CACAT GGAACAGTCT GCCT C C C AGTATTT CTCT CT GGGAT CCCAGT GGAACA 
%! ATAATGGAAAATTT CACAGTGGAT GCAACTT CCAAAAT GGCCTAT CT CAGTATT C CAGGAACTG CAAAGGT GGGCACT 

g TGGGC^TACIAATCTTCAAGCCAAAGCGAACCCAGAAACIATTAACTATTACIAG 

J.,, GT GCCT CCAAT CACAGT GAAT GCTAAAAT GAATAAGGACGTAAACAGTTT CCCCAGCCCAAT GAT T GTTTACGCAGAA 

13 ATTCTACAAGGATAT GTACCT GTT CT T GGAGCCAAT GTGACTGCTTTCATTGAAT CACAGAATGGACATACAGAAGTT 

If! TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
J! ACAGAAAATGGC^GATATAGCTTAAAAGTTCGGGCTC^GGAGGAGCAAACACTG 

O CT GAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAAT TGAAGCAAACCCGCCAAGACCT GAA 

ft I GAGGATACTCAGACCAC CTTGGAG GATTT CAGCCGAACAGCATCCGGAGGTGCATTTGT GGTAT CACAAGT CCCAAGC 

CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCA 
|J ACATGGACAGCACCAGGAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
j?f 1 GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCC^AGGAA 
* * AGCT TT GCAT TTAAACCAGAAAATAT CTCAGAAGAAAAT GCAAC CCACATATTTATT GCCAT TAAAAGTATAGATAAA 

AGCMTTTGACATCAAAAGTATCCAACATTGC^CAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGAT 
C CTACACCTACTCCTACTCCTACT CCTACTC CTGATAAAAGT CATAATT CT G GAGTTAATAT TTCTACGCT GGTATT G 
T CT GT GATT GGGT CT GTT GTAATT GTTAACTTTATTTTAAGTACCACCATTTGAACCTTAACGAAGAAAAAAATCTT C 
AAGTAGAC CTAGAAGAGAGTTTTAAAAAACAAAACAAT GTAAGTAAAGGATATT TCT GAAT CTTAAAATTCATCCCAT 
GTGTGATCATAAACTCATAAAAATAATTTTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
T GTAAAAACT GT CAAGAT TAAAATTTAATAGTTT CATT TATTT GTTATT TTATT T GTAAGAAATAGT GATGAACAAAG 
AT C CTTT TT CATACTGATACCT GGTT GTATATTATTT GATGCAACAGTTTTCT GAAAT GATAT TTCAAAT TGCAT CAA 
GAAATTAAAAT CAT CTAT CTGAGTAGT CAAAATACAAGTAAAGGAGAGCAAAT7W^CAACATT T GGAAAAAAA7\AAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAW 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIVNKLIQIKSSDERNTmAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
P% S HFYVS DEAQNNGL I DAFGALTS GNTDLS QKS LQLESKGLT LNSNAWMNDT VI I DS TVGKDT FFL 

ll\ I TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPE TLT I TVTSR 

|y AANS SVPPI TVNAKMNKDVNS FPSPMIVYAE I LQGYVPVLGANVTAFI E S QNGHTEVLELLDNGA 

tf] GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVWGEIEANPP 
^ RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SAS ILDLRDS FDDALQVNTTDLS PKEANSKES FAFKPENI SEENATHI FI AI 
: |3 KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
MI VNFILSTTI 

5' 

tSJ Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



5 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGTGA 
CAACAG GT GTCATCTTTTT GAT CT CGT GT GT GGCTGCCTT CCTATTT CAAGGAAAGACGCCAAGGTAATTTT GACCCA 
GAGGAGCAATGATGTAGC CACCT CCTAAC CTT CCCT TCTTGAACCCC CAGTTAT GCCAGGATTTACTAGAGAGT GTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCTCAGAAGTGACTT GCCTGAGGGT GGAC CAGAAGAAAGGAAAGGTCCCCTCTT GCTGT T GG CT GCACAT CAGGAA 
GGCTGT GAT GGGAATGAAGGTGAAAACTT GGAGATT T CACTT CAGT CATTGCTT CT GCCTGCAAGAT CATCCTTTAAA 
AGTAGAGAAGCTGCTCTGT GT GGTGGTTAACT CCAAGAGGCAGAACT C GTT CTAGAAGGAAATGGAT GCAAGCAG CT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTT TTGCT GGT GCT CCT CT GCT GT GCTAT CT CTGT C CT GTACAT GTT GGC CT GCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACTGCCCAGGGC CAACAGCCCCAC GGGGAAGGAG GGGTACCAGGCCGTCCTT CAGGAGT GGGAGGAGCAGCAC 
CGCAACTACGTGAGCAGC CTGAAGCGGCAGAT CGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
$*\ AATGGGCAGTACC^GCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CT GGCCT T CCTGCACT C GCAGGT GGACAAGGCAGAGGTGAATGCTGGCGTCAAGCT GGCCACAGAGTAT GCAGCAGT G 
fa CCT TTC GATAGCTTTACT CTACAGAAGGT GTACCAGCTGGAGACT GGCCTTACCCGCCACCC CGAGGAGAAGCCT GT G 

fH AGGAAGGACAAGCGGGATGAGTT GGT GGAAGC CATT GAAT CAGCCTT GGAGACCCT GAACAATCCTGCAGAGAACAGC 

fl CCCAAT CACCGTC CTTACACGGCCTCT GATTT CATAGAAGGGATCTACCGAACAGAAAGGGACA7\AGGGACATT GTAT 

III GAGCTCACCTTCAAAGGGGACCAGAAAC^CGAA 

GT GAAAAATGAAAAGCT CAACATGGCCAACAC GCTTATCAAT GTTAT CGTGCCT CTAGCAAAAAGGGTGGACAAGTT C 
CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
AAAGAAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCA^ 

CAGCTGAATGGAGAATTT TCT C GGGGAAAGGGACTT GAT GTTGGAGC CCGCTT CT GGAAGGGAAGCAAC GT C CTT CT C 
TT T T TCT GT GAT GT GGACATCTACTT CACAT CTGAATT C CT CAATAC GTGT AGGCT GAATACACAGCCAGGGAAGAAG 
» GTAT TTTATCCAGT TCTT TTCAGT CAGTACAATCCT GGCATAATATACGGCCACCAT GATGCAGTCCCT CCCTT GGAA 

f*% CAGCAGCT GGT CATAAAGAAGGAAACT GGATTTT GGAGAGACTT TGGATTTGGGAT GACGT GTCAGTATCGGTCAGAC 

TT CATCAATATAGGTGGGTTTGAT CT GGACAT CAAAGGCTGGGGCGGAGAGGAT GT GCACCTTTATCGCAAGTATCTC 
II I CACAGCAACCT CATAGT GGTAC GGACGCCTGT GCGAGGACTCTTCCACCT CT GGCAT GAGAAGCGCT GCAT GGAC GAG 

f ; | CT GACCCCCGAGCAGTACAAGAT GTGCAT GCAGT CCAAGGCCAT GAACGAGGCAT C CCAC GGCCAGCTGGGCAT GCT G 

GTGTTCAGGCACGAGATAGAGGCTC^CCTTCGC^W^CAGAAAC^GAAGACAAGTAGCAAAA 
ill GAAGGATT GTGGGAGACACTTTT T CT TTC CTTTT GCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 

PI GGACGACAAAAGAATTGGACT GAT GGGTCAGAGATGAGAAAGCCT CC GATTT CT CTCTGTT GGGCTTTTTACAACAGA 

J- AATCAAAAT CT CC GCT TT GCCT GCAAAAGTAACC CAGT T GCACCCTGTGAAGT GT CT GACAAAGGCAGAAT GCTTGT G 

fll AGATTATAAGCCTAAT GGT GT GGAGGTTTTGATGGT GTTTACAATACACT GAGAC CT GTT GTTTTGT GT GCTCATT GA 

AATATT CAT GATTTAAGAGCAGT TTT GTAAAAAATT CATTAGCATGAAAGGCAAGCATATTT CT CCT CATAT GAAT GA 
GC CTAT CAG CAGGGCT CTAGTTT CTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGATAGG CT TATTAT GATACT 
AGTGAGTACATTAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAATATC GTGT CATATTT T CCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
AT T TTTATATTTTTTAAGAAGATACTTTGAGATGCAT TATGAGAACTTT CAGTT CAAAGCAT CAAATTGAT GCCATAT 
CCAAGGACATGCCAAATGCTGATTCTGTCAGGCIACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACT 
AATACAGACGTACAGATACTTT CT CT GAAGAGTATTT TC GAAGAGGAGCAACTGAACACT G GAGGAAAAGAAAATGAC 
ACTTTCT GCTT TACAGAAAAGGAAACT CATT CAGACT GGT GATAT C GTGATGTACCTAAAAGTCAGAAACCACAT TT T 
CT C CTCAGAAGTAGGGAC CGCTT T CTTACCT GTT TAAATAAACCAAAGTATACC GTGT GAACCAAACAAT CT CTTTT C 
AAAACAGGGTGCT CCTCCT GGCTT CTGGCTT CCATAAGAAGAAATGGAGAAAAATATATATATATATATATATATTGT 
GAAAGAT CAAT CCATCT GCCAGAATCTAGTGGGATGGAAGTTTT T GCTACAT GTTATCCACC CCAGGCCAGGTGGAAG 
TAACTGAATTATTTTTTAAATTAAGCAGTTCTACTCAAT CACCAAGAT GCT T CT GAAAATT GCATT TTATTACCATT T 
CAAACTATTTT TTAAAAATAAATACAGTTAACATAGAGT GGT TT CT T CATT CAT GT GAAAATTATTAGC CAGCACCAG 
ATGCAT GAGCTAATTAT CT CTTTGAGT CCTT GCTTCTGTTT GCT CACAGTAAACTCATTGTTTAAAAGCTT CAAGAAC 
AT T CAAGCT GT T GGTGT GTTAAAAAAT GCAT T GTATTGATTT GTACT GGTAGTT TATGAAATTTAAT TAAAACACAGG 
CCAT GAAT GGAAGGTGGTATTGCACAGCTAATAAAATAT GAT TTGT GGATAT GAA 



APP_ID=10063569 



Page 223 of 320 



FIGURE 72 

MMMVRRGLIAWISRVWL 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
TL INVI VPIAKRVDKFRQBMQNFREMCI EQDGRVHLT WYFGKEE INEVKG I LENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNL I WRT PVRGL FHLWHEKRCMDELT PEQ YKMCMQSKAMNEASHGQLGMLVFRHE IEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites . 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



GAGAC T GCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT TT GTCC TGGGGATCCA 
GAAACCCAT GATACCC TAC TGAACACCGAAT CCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACT GGACGTAT GAGGGCCCACATGGTCAGGACCAT TGGCCAGCCTC T TACCCT GAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACT GGGG TCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGAT CAACAG 
TGAAGCCACAT T TGCAGAGCTCCACAT T GTACATTAT GACT CTGAT TCCTATGACAGCT T GAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TATGAACACATTCTGAGTCACT TGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
C CGAAAGAGTGTGGTC T TCACCT CAGCACAAGCCACGAC TGAGGCATAAAT T CCT TC TCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGAT GTCCT CC T T CCCC TGGACAT CT CT TAGAGAGGAAT 
GGACCCAGGCTGTCAT T CCAGGAAGAAC TGCAGAGCC T T CAGCC TCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCT TAGGATAAAGAGTT GCTGT TGAAGT TGTATAT T T TT GATCAATATAT T TGGAAAT TAAAG 
TTTCTGACTTT 
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FIGURE 74 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFT S AQAT TEA 

Q Important features of the protein: 

W Signal peptide: 

r ! amino acids 1-15 

|s| 

III 

If* Transmembrane domain: 

|[1 amino acids 291-310 

Si 

N-glycosylation site. 

1.11 

7*i amino acids 213-216 

Q Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGA ATGT CCTACA 
AT GGACTCCACCAGAGGGT CT TCAAGGAGT TAAAGTT AC TTACAC TGTGCAGTAT T TCAT CACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAG T G G AAGAGAAAT C CAGAAGAC CTTCCTGTTT CCAT GCAAC AAAT AT AC T C CAAT C T GA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
T C TGTGAT GGGC TAT T CCAT C TACCGATATAT CCACGT TGGCAAAGAGAAACACCCAGCAAAT T T 
GAT T TT GAT T TATGGAAATGAAT T TGACAAAAGAT TC T T TGTGCC TGCT GAAAAAATCG T GATTA 
AC T T TATCACCCTCAATATCTCGGATGATTCTAAAAT T T CT CATCAGGATAT GAGT T TAC TGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATT TAGGGTATGCT T CGCAT T TGAT GGAAATT TT T TGTGACTC TGAAG 
AAAACACGGAAGGTACT TCT CTCACCCAGCAAGAGTCCCT CAGCAGAACAATACCCCCGGATAAA 
ACAGTCAT TGAATATGAATATGAT GT CAGAACCAC TGACAT T TGT GCGGGGCC TGAAGAGCAGGA 
GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACAT TAT TGGAGTCGCAGGCAGCGT TGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCT T CT GAGGGGGAT GGGC TCGGAGAGGAGGGTCT T CTAT CTAGACTCTAT GAGGAGCCGGC T 
CCAGACAGGCCAC CAGGAGAAAATGAAACCTATC TCAT GCAATTCATGGAGGAATGGGGG TTATA 
TGTGCAGATGGAAAAC TGA TGCCTVACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGT CAATAAATG T T TGCCAGACT GGGT GCAGAAT T TAT TCAGGTGGGT GT 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYWSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGA.TC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATC TGGC TGAAGGTCAT CACAGCTAACATCCTCCAGCT GCAGGTGAAGCCC T CGGCCAAT GACCA 
M GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 

2 TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
III CC.CACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 

3 t I TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
Ifl CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
ff* CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 

TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
Fi AGGTGACCAAGT GGT T CAATAAC TC TGCAGCT TCCCT GACAATGCCCACCC TGGACAACATCCCG 

Iff TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
CI ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
[Y* TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
Sjt ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACACCAAAGGTGACCAACT TATACT CAACT TGAATAACATCAGC T CT GATCGGATCCAGC TG 
ATGAAC TCT GGGAT TGGCT GGT T CCAACCTGATGT TCT GAAAAACATCATCACTGAGAT CAT C CA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWTFTLLCGLIAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLI LNLNNI S SDRIQLMNSGIGWFQPDVLKNI I TE I IHS ILLPNQNGKLRSGVPVSLVKALG 



O FEAAESSLTKDALVLTPASLWKPSSPVSQ 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 



fy Glycosaminoglycan attachment site. 

O amino acids 412-415 

SPSS 



LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGT CAGCC TGGCAGAGAGAC TCTGAAATGAGGGAT TAGAGGT GTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCC ATG GCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
1^ CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
Q GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
U ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
~fl CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
^| CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
m CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
y;l GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
* GGTCT GTGAAAAACAGT GGACAGCACCCCGAGGGCCACAGGT GAGGGACACTACCACTGGAT CGT 

13 

gtcagaaggtgctgctgaggatagactgactttggccattggattgagcaaaggcagaaatgggg 
PI gctagtgtaacagcatgcaggttgaattgccaaggatgctcgccatgccagcctttctgttttcc 
HI tcaccttgctgctcccctgccctaagtccccaaccctcaacttgaaaccccattcccttaagcca 
CI ggactcagaggatccctttgccctctggtttacctgggactccatccccaaacccactaatca^ 
tcccactgactgaccctctgtgatcaaagaccctctctctggctgaggttggctcttagctcatt 
gctggggatgggaaggagaagcagtggcttttgtgggcattgctctaacctacttctcaagcttc 
cctccaaagaaactgattggccctggaacctccatcccactcttgttatgactccacagtgtcca 
gactaatttgtgcatgaactgaaataaaaccatcctacggtatccagggaacagaaagcaggatg 
caggatgggaggacaggaaggcagcctgggacatttaaaaaaata 
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FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YSTLLGLPADIQAAQAMMVTS SAI S S LAC 1 1 S WGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N -glycosylate. on site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
!Z CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
S . ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

ft% GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
||| TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
Us AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

m 

%l ACAGAGGGCCAGATC TACTGCGGCC TCACAACCT GCCCCGAACCAGGCT GCCCAGCACCCCT CCC 

AC TGCCAGACTCC TGCT GCCAAGCC TGCAAAGAT GAGGCAAGTGAGCAATCGGAT GAAGAGGACA 
f : GTGTGCAGTCGCT CCAT GGGGTGAGACATCCTCAGGAT CCAT GT TCCAGTGATGC T GGGAGAAAG 

If! AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACT GT CAAGATCGTCC T GAAGGAGAAACATAAGAAAGCCT 
|r GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
||| CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GT GAAGTACCTGGCCCAAGGCCACACAGCCAGAAT CT TCCAC TT GACT CAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCT T CC TAGC CCAGACCC TGGAGCT GAAGGT CACGGCCAGTCCAGACAAAGT GACCAAG 
ACATAACAAAGACC TAACAGTT GCAGATATGAGCT GTATAAT TGT T GT TATTATATAT TAATAAA 
TAAGAAGT T GCAT TACCC T CAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



i 



MVPEWVLSSLLGIJUjLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQS DEEDS VQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I SSTRCPKAPGRVLVHTSVS PS PDNLRRFALEHEASDLVE I YLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGGAC AAAACT GGTGAAGGATTC T ATTGC GG AGGGGC GC AT CT CTCT GAGGCTGGAAAACATT ACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AGACTCCAGGAC AAAC AG AGAC AT GC AT GGCCT GTTTGAT GTGGAGATCT CT CT GACCGTCCAAGAGAA 
Iff CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
Ol GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
W TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
'I,.. GAGAAGAAAGCACGGACAGGCAGAAT T GAGAGACGCCCGGAAAC ACGC AGT GGAGGTGACTCTGGATC CAG 

n 

J:! AGAC GGCT CACCCGAAGCTCTGCGTT T CTGAT CT GAAAACTGTAACCC AT AGAAAAGCTC CCCAGG AGGTG 

IP I 

CCTC ACTCTGAGAAGAGATT TACAAGGAAGAGT GTGGTGGCT TCTC AGAGTT TCCAAGCAGGGAAACATT A 

u 

s7Sii CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AGGAGT ACGTGACT TT GTCT CCCGATC ATGGGT ACT GGGT CCTC AGAC TGAATGGAGAACATTT GTATTT C 

fll ACATTAAATCCCCGTT TT AT CAGCGTCTTC CCCAGGACCC CACCT ACAAAAAT AGGGGTC TTCCTGGACT A 

TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCC CT AC AT T GAGT ATCCGTCCT AT AATGAGCAAAATGG AACTCCCATAGT CATCTG C 
CCAGTC ACCC AGGAATCAGAGAAAGAGGCCTC TT GGCAAAGGGCCTCT GCAATCCCAGAG ACAAGCAACAG 
TGAGTCCT CCTC AC AGGCAACC ACGCCCTT CCT CCC CAGGGGT GAAAT G TAGG ATGAATCACAT CCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GACATT AC AT TT AGTT TGCT CT C ACT CC AT CTGGCT AAGT GATCTT GAAAT ACCACCT CTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
AT CTTATT GATGAC AGAGTGTAT C CT AATGGTTT GT TCAT TATATT AC ACT TTCAGTAAAAAAA 
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FIGURE 8a 



MMiMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGS VPL I S I TGYVDRDI QLLCQS SGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVE I SL 
TVQENAGS I SCSMRHAHLSREVESRVQI GDT FFEP I SWHLATKVLGI LCCGLFFGI VGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATT.PFLPRGEM 



r 3 Signal peptide: 

||| amino acids 1-17 

CJ! 

Ssii Transmembrane domain: 

* amino acids 239-255 
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FIGURE 8* 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GC TCT GGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAAC TGC TGACGAT GCAGAGTT CCGT GA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCACCCT GAGCATCAGAGAT GCCAGAAGAAGTGATGCGGGGAGA 
I™ TACT T CTT TCGTATGGAGAAAGGAAGTATAAAATGGAAT TATAAACAT CACCGGC TCTCT GTGAA 

1% TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
f|| AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
hi GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
If! GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
¥ * ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

i; r*| 

GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
p| GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
|H GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
C| C TGAGGGATGCAGC TGAAT T CACC TGCAGAGC TCAGAACCCT C TCGGCTCT CAGCAGGTCTACC T 

GAACGT CT CC CTGCAGAGCAAAGCCACATCAGGAGTGAC TCAGGGGGT GGTCGGGGGAGC TGGAG 
|n CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGA GAAACT GCAGAGACT CACCC TGAT TGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGC T GATT CT TGTAGAAT TAACAGCCCTCAACGTGAT GAGCTATGATAACACTATGAATTATG 
TGCAGAGT GAAAAGCACACAGGC T TTAGAGT CAAAGTAT CT CAAACCTGAATCCACACTGTGCCC 
TCCCT T TTAT TT T T TTAACTAAAAGACAGACAAAT TCC TA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHGYWFREG 
ANTDQDAPVATNNPARAWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 



PI amino acids 1-15 



Transmembrane domain: 

pi amino acids 351-370 

s n 
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FIGURE 87 



AGAAAGCT GCAC TCT GT TGAGCTCCAGGGCGCAG TGGAGGGAGGGAGTGAAGGAGC TC T C TGTAC 
CCAAGGAAAGTGCAGCT GAGAC TCAGACAAGAT TACA ATGA ACCAAC TCAGC T TCC TGC TGTT TC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACT TAC T T CAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
|3 CAACTACAACACC TT T GGATCTGCAGAGGCGGCCACGAGCGATGAC TACAAGAACCCTGGCTAC T 



ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGC TCCC TGC TGAGGTACCGCACGGACAC TGGCT T CCTCCAGACACTGGGACATAAT CT 
GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
|fl CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
%U GGCCAGCGGGAAT TCACTGCGGGAT TTGT TCAGT TCAGGGTATT TAATAACGAGAGAGCAGCCAA 

* CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
!!f GATACT TT CCAGAGGCCAGTCCCCAGCAGTGTGGAGAT TT T TCT GGT T T T GAT TGGAGTGGATAT 

£ GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
f|| TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
0 CAACTTACCCAGTAGCTAGAATGT TAATGGCAGAAGAGAAAACAATAAATCATAT T GACT CAAGA 

AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWT DNG PVI PWYDFGDAQKTAS Y YS P YGQRE FTAGFVQ FRVFNNERAANALCAGMRVT GCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 



H Important features: 

Signal peptide: 

fp amino acids 1-16 



||j N-glycosylation site. 

amino acids 163-167 



Glycosaminoglycan attachment sites • 

amino acids 74-78, 289-293 

N-myristoylation sites, 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTG7\AATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
fit TGACGGGAACC TAAGGACGAAGCC TACAGGGGCAAGGGCCGCTT CGTAT TCCT GGAAGAAGGAAG 

|J GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
ilj TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
%! • TTAAGACTTATATACAGTTTTAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWI TLFWSRDSNI QACLPLT FT PEE YDKQDI QLVAALSVTLGLFA 
VELAGFLSGVSMFNS TQSLI S IGAHCSASVALS FFI FERWECTTYWYI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Ql 



C TGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGAC GAGAGCGGAGGAGGAAGATGCAACT GAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGT GAAGAAAATC TT TGGCTGGGGCGAC T TCTAC TCCAACAT CAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
JJJ C CAAC ACAAT GCCACAG GCCAGGGAAACAT C TC CATC AGCC T C G T GC C C CC CAG T AAAGC T G TAG 

*w% AGT TCCACCAGGAACAGCAGATCT TCAT CGAAGCCAAGGCCT CCAAAAT C T TCAACTGCCGGATG 

hj GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCT CGC T T TGCACCCACGACCCAGCCAAGAT CT G 

111 CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TC TACATCGCC T TCTACAGCACGGACTATCGGC TGGT CCAGAAGGTGT GCCCAGAT TACAAC TAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
|*| TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
If! AGGCAGGGAGGGGGGT GGAGACGAGGAGAT GCCAAGT GGGGCCAGGGCCAAGT CT CAAGT GGCAG 

Q AGAAAGGGTCCCAAGT GCT GGTCCCAACCT GAAGC TGTGGAGTGAC TAGAT CACAGGAGCACT GG 

JJf AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
5"l TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CC TTGGTT CT TGCCAT CCTGAGGAAAGATAGCAACAGGGAGGGGGAGAT TT CATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSUL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI S I SLVPPSKAVE FHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKI CSRDHAQS SATWSCSQP FKWCVY I AFYS TDYRLVQKVCPDYNYHS DT PYYPS G 

Important features of the protein : 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Qft 

CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGAT GGACCAACACAGAAATATCTGC T GATC TT TGGAGCGT T T GT CT C TGT CTATATCCAAGAA 
AT GT T CCGAT TTGCATATTATAAACT CT TAAAAAAAGCCAGTGAAGGT T TGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE qa 

MTAAVFFGCAFIAFGPALALYVFTIAIEPLRI IFLIAGAFFWLVSLLISSLWFMARVI IDN 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AATT TT TCACCAGAGTAAAC TTGAGAAACCAAC TGGACC T TGAGTAT T GTACATT T TGCC TCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGAT TC TACTGT T T T GT CT TC TAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCT TCTT TAAGTCTGATACCATTAACA 
CAGAT GCTCACAC TGGGGCCAGAT CT GCATCTGT TAAATCC TGCT GCAGGAATGACACC TGGTAC 
C CAGACCCACCCATTGACCCTGGGAGGGT T GAAT GTACAACAGCAACT GCACCCACAT GTGT TAC 
M CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 

0 TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCGACCAGTCAGGCAGGGGC 
I* T AAT C CAGAT GT C CAGGAT GGAAG CC T T C CAGCAGGAGGAG CAGG T G T AAAT CC T G CCACCCAGG 

II GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 

*U 

I II G CAGG CAT C CAAAGGAGCACACAT GC CAT C GAGGAAGC CAC C ACAGAAT CAGCAAATGGAAT T C A 

01 GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 

ss. 

W ATTGATTATATTATGGAATAGATTGAGACACATTGGATAGTCTTAGAAGAAATTAATTCTTAATT 

* T ACCT GAAAATAT TCT TGAAATT TCAGAAAATATGT TC TATGTAGAGAATCCCAACT T T TAAAAA 

p| CAATAATT CAAT GGATAAATC TGTCT T TGAAATATAACAT TATGC T GCC TGGATGATATGCATAT 

fl T AAAACAT AT TT GGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

ill AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

CI 

m • 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKIiAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDD FAVT T PAG I QRS THAIEEAT TESANTG I Q 



Signal peptide: 

amino acids 1-16 



W 
If! 



III 

0 

ri! 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGC TACAGGT CACCCT GGAGATGCAGGAT GGACATG TC TTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
hi GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
m CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
lit ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
y CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
Im CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACM 
Ul CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
||! T GCAGAGGGAAACC TC TACGTGACCAGAGAGC TGGACAGAGAAGCCCAGGCTGAGTACC TGC TCC 

m AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
%l CAGTCCACGAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
* ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
Q CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
8^ CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
Ji* TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
14 ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
HI CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
Q GCCTGGAT T GGGAGCCAGACTCT GGGCATGT TAGAC T CAGAC T CTGCAAGAACC TCAGT TATGAG 

GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
8 U AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACGCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T TCACCCACTGGACCAT GTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCT GGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCC TCCA 
TC TGC C C TGGGG TGGAGGCACCATCACCAT CACCAGGCATGT CT GCAGAGCC TGGACAC CAAC T T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAA?^AAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQMYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQWDMGDQASGHQATATVEVSIIESTWSLEPIHIAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAP SHEVWWQS VAKLVGPGPGPGATATVT VLVERVMP P PKLDQES YE AS VP I S APAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
RE H 1 1 PVWS HNAQMWQLL VRVI VCRCNVE G QCMRKVGRMKGMP T KLS AVG I L VG T L VAI G I FLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 

GGCT GACCGTGC TACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCC TG 
AGTCCAAGAT TC T TCCCAGGAACACAAACGTAGGAGACCCACGCTCCT GGAAGCACCAGCCT T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT CTAGGACCCAGGCATCT T GCTT TCCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAATG 
T TCT CCT TATGT T TGGT C TACT AT TGCAT TT AGAAGC T GCAACAAAT T CCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TT CAGCACAGCGTCCAGTGGGAT CAGCATAGCCAC CAACT CT GAGTCCAGCACAACC TCCAGT GG 
GGCCAGCACAGCCACCAACTCT GAGT CCAGCACACCC TCCAGTGGGGCCAGCACAGT CACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCT CCAGTGGGGC TAGCAGAGCCACCAACT CT GAGTCCAGCACAA 
CCT CCAGT GGGGCCAGCACAGCCAC CAACT CTGAGTCCAGCACAGT GT CCAGTAGGGCCAGCACT 
GCCAC CAACT CT GAGTCCAGCACAACCT CCAGTGGGGCCAGCACAGCCACCAACTC TGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTC TGAC TCCAGCACAGT GTCCAGT GGGGCCAGCACTGCCACCAAC TCT GAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACT CT GACTCCAGCACAACC T CCAGTGGGGCCGGCACAGC CACCAACT 
CT GAGT CCAGCACAGTGTCCAGT GGGATCAGCACAGTCACCAATT CT GAGTCCAGCACACCC TCC 
AGT GGGGCCAACACAGCCAC CAACT CTGAGT CCAGTACGACC TCCAGT GGGGCCAACACAGCCAC 
CAAC TCT GAGTCCAGCACAGTGT CCAGT GGGGCCAGCACTGCCACCAACT CTGAGT CCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCAC CAAC T CT GACTCCAGCACAACCT CCAGTGAGGCCAGCACAGCCAC CAAC TCT GAGTC TAG 
CACAGT GTCCAGTGGGAT CAGCACAGTCACCAAT TC TGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGTCCT AAC TGGT TCTGGAGGAGACCAG TAT CATCGATAGCCAT GGAGAT GAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C T TGAAGAAGGTAT TCC TCACCT T T CTT GCC T T TACCAGACACT GGAAAGAGAATAC TAT AT 
T GCTCAT TTAGC TAAGAAATAAATACAT CTCATCTAACACACACGACAAAGAGAAGC TGTGCT T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCAT TCAAAATCTCCACAGTAAAAT CCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE loo 



!?■'»'■ ! i! 



MKMQKGNVLLMFGLLLHLEAATNSNET ST SANTGS S VI S SGAS TATNSGS SVTS S GVS TAT I S GS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GAS TVTNS GS S VT S S GAS TATNSE S S TVSSRAS TATNS ES S T LS S GAS TATNS DS S TT S S GAS TA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVS SGAS TATNSESSTTS SGAS TATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNS LS LRNT FNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 



IPS Transmembrane domain: 

W 

Hi 



amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGAT GATGT GATATATGCAT TCCAGGGGAAGGGAAAT TGTGGTGCT TCT GAACCCATGGT CAAT T 
AACGAGGCAGTT TCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCT TTGGAATCAT 
GGT GT CAT GGAAAGGGATT TACT T TAT ACT GACTC TGT T TT GGGGAAGCT T T T TTGGAAGCAT TT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAACT GGGGATGCAT TTGT T CCT GGAGAAAGAAGTGTCAT TAT CATGAACCATCGGACAA 
GAATGGACTGGATGTTCCTGTGGAAT TGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CT T CAT T CATAGGAAAT GGAAGGAT GACAAGAGCCATT TCGAAGACAT GATT GAT TACT T T TGTG 
ATATTCACGAACCACTTCAACTCCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
TCT CGAAGTAATGCAT T TGC TGAAAAAAAT GGAC T T CAGAAATAT GAATATGT T TTACAT CCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
AT AT TT GTACAGTCT TGT TAAGTGGT AT TT TATAATCACCAT T GTAATC T T T GTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACT TGCATGTTACCGACTT TTACACAAACAGCCACAT 
TTAAATTCAAAGAAAAATGAGTAAGATTATAAGGT TTGCCATGTGAAAACCTAGAGCATATTTT G 
GAAAT GT TCTAAACCTT TCTAAGCT CAGATGCAT T T TT GCAT GACT ATGT CGAATAT T TC T TACT 
GCCATCATTAT TTGTTAAAGATATT TT GCACT TAAT TT T GTGGGAAAAATAT TGCTACAAT T TT T 
T TT AAT CTCT GAAT GTAATT TCGATAC T GTGTACATAGCAGGGAGTGATCGGGGTGAAATAAC T T 
GGGCCAGAATATTAT TAAACAATCATCAGGCT TTTAAA 



FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI I TGDAFVPGERS VI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFSPAMCLLI YLYSLVKWYFI I T IVI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, ■ 354-377 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAAT CATCCATCCACCCCT GCTGTCAT CTGT T TT CATAGTGT GAGATCAACCCACAGGAAT A 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACC T CTACAGAGATGGGGAAGACTGGGAAT CTAAGCAGATGCCACAGTAT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
H GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
Q AGGAT T TGT C T TCAGAC TCCAGAGCAAATGCAGATGGGTACAGCCT GTATGATGTGGAGATC TCC 

Q ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
#f* GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAATCCAAAGGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAG 
If? AGACGCCCGGAAACACGCAGTGGAGGTGACT CTGGAT CCAGAGACGGCT CACCCGAAGC TC TGCG 

ffl T T TCT GAT CT GAAAAC TGTAACCCAT AGAAAAGCTCCCCAGGAGGT GCCTCAC TC TGAGAAGAGA 

uf% TTTACAAGGAAGAGTGTGGTGGCT TCTCAGGGTTTCCAAGCAGGGAGACAT TACTGGGAGGTGGA 

l"' CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
1 ? ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
CI ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
|f| GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
PI T GACAT GTCAGTT TGAAGGCT TGT TGAGACCC TATATCCAGCATGCGAT GTATGACGAGGAAAAG 

J?? GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
13 CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
HI TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACAT TAGGTT TAGTT TGTGAAAA 
CT CCATCCAGCTAAGCGATC T TGAACAAGT CACAACC TCCCAGGC T CCTCAT T TGCTAGTCACGG 
ACAGT GATT CCT GCCTCACAGGT GAAGATTAAAGAGACAACGAATGTGAAT CATGCT TGCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
T GCTTATTCCACAT TAAT TTACTT TTCTCTATACCAAATCACCCAT GGAATAGTTAT TGAACACC 
TGCTTTGT GAGGC T CAAAGAAT AAAGAGGAGG TAGGAT T T T T C AC T GAT T C T ATAAGCC CAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
CTCAT TAACACAGACACAAAAATTCTAAAT AAAAT TT TAACAAATTAAACTAAACAATATATT TA 
AAGATGATATATAACTACTCAGTGTGGT TT GTCCCACAAATGCAGAGT TGGTT TAATATT TAAAT 
ATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEMEVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAAIjGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGWMGMIIVFFK 
SKGKI QAELDWRRKHGQAELRDARKHAVEVT LDPE TAHPKLCVS DLKT VTHRKAPQEVPHSEKRF 
TRKS WASQG FQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVT LS PNNGYWVLRLTTEHL YFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CCTTCAC AGGACTCTT CATT GCT G GT TGGC A ATGA T GT AT CGGCCAGATGTGGT GAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACTGT TC ATTATGT GAG AT AT AATCAAAAGAAGACCT ACAAT T ACT ATAGC ACAT TGTC AT TT ACAAC 
TGACAAACTAT ATGCT GAGTTTGGC AGAGAGGCT TCT AAC AATT TT ACAGAAAT GAGCCAGAGACTTGAAT 
CAAT GGTGAAAAAT GC AT TT TATAAATCT CCATT AAGGGAAGAATT T GTCAAGTCTCAGGTT AT CAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACT GT AGAT AAAATT GTT C AACTTGTT T T AC ATGAAAAGCT GCAAGATGCTGT AGGACCCCCT AAAGT AG 
p AT CCT C ACTCAGT T AAAATT AAAAAAATC AAC AAGACAGAAACAGACAGCTATCTAAACCATTGCT GCGGA 

S AC ACGAAGAAGT AAAACTCT AGGT CAGAGT CT CAGGATCGT T GGTGGGAC AGAAGT AG AAGAGGGT GAATG 

m GCCCT GGC AGGCT AGC CTGC AGTGGGATGGGAGT CATCGCTGTG GAGC AACCTT AAT TAATGCC AC ATGGC 

I : | TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
Iff AT AAAACCTT CGAAAATGAAACGGGGTCTCCGGAGAAT AATT GT CCATGAAAAAT ACAAACACCCAT C ACA 

01 TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
\hi TCCCTGAT GC ATCCT AT GAGTT TC AACCAGGT GAT GTGAT GTT T GTGACAGGAT TTGGAGCACT GAAAAAT 

* GATGGTTACAGTCAAAATCATCTTCGACAAGC AC AGGT GACTCT CATAGACGCT ACAACT T GCAAT GAAC C 

PI 

TC AAGCTT AC AATGAC GCCATAACT C CTAGAATGT T ATGT GCTGGC TCCT TAGAAGGAAAAACAGATGC AT 
%l GC CAGGGT GACTCT GGAGGACCACT GGT TAGT TC AGAT GCTAGAGATATCTGGT ACCT TGCTGGAAT AGT G 

AGCT GGGGAGATGAAT GTGC GAAACC CAACAAGC CTGGTGTTT ATACT AGAGTT ACGGCCTTGC GGGACTG 

% Is? 

GATT ACTT CAAAAACT GGT ATC TAA GAGACAAAAGCCT CATGGAAC AGAT AACAT TTTTTTTTGTTTTTTG 
fH GGT GTGGAGGCC AT TT TT AG AGAT AC AG AATT GGAGAAGACTTGCAAAACAGCT AGAT TT GACTGATC TC A 

ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACT CT GTCAT CTGTGAGCAAT AGT TGAAACTTT AT GT AC AT AGAGAAAT AG ATAATACAAT AT T AC 
AT T ACAGCCT GT AT T CATTT GTTCTCTAGAAGTT TT GT C AGAAT TTTGACTTGTTG AC AT AAATTT GT AAT 
GC AT AT AT ACAATT TGAAGC ACTCCT TT T CTT C AGTTCCT C AGC TCCTCTCAT TTC AGCAAAT ATCCATT T 
TC AAGGT GCAGAAC AAGGAGTGAAAGAAAAT ATAAGAAGAAAAAAATCCCCT AC AT T T T ATT GGCACAGAA 
AAGT AT T AGGTGTT T T TCTT AGTGGAAT ATT AGAAATGAT CAT ATT CATTAT GAAAGGTC AAGC AAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CC TT AT TTTC ATTTCC AAAC AACT AC T ATGAT AAATGT GAAGAAGATTCT GT TTTTTT GT GAC CT 
AT AAT AAT T ATAC AAACT TC AT GCAATGT ACT TGT TCT AAGC AAAT T AAAGCAAAT AT TT ATTT AACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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MMYRPDVVRARKRVCWEPWVIGLVIFISLIVLAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AEFGREASNNFTEMSQRLESMVKNA 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQAS LQWDGSHRCGATL INAT WLVSAAHC FTT YKNPARWTAS FGVT I KPSKMKRGL 
RRIIVHEKYKHPSHDYDISIAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVT L I DATT CNE PQAYNDAI T PRMLCAGSLE GKTDACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCT.CCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT T CT CACT TTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCAT T T TGAC T TAC T T TCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACT T.T GAAAGCCAGGAT GACAAAT TATGGATTACCTAGATATCGGTGGCT TAC T CA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
AAATGACAGAGATGGAC TGGCCCCCAGAT T CCTGC TGT GTTAGAGAAT TCCCAGGAT GT TCCAAA 
l *{ CAGGCCCACCAGGAAGATC TCAGTGACCTT TATCAAGAGGGT TGT GGGAAGAAAATGTAT TCC TT 

*] T TT GAGAGGAACCAAACAAC TGCAGGTGCT GAGGT TTCT GGGAAT CTCCAT TGGGGT GACACAAA 

l| TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
GT T GAAACCAAGCC TGTCAAGAAT CT T T GAACACACATCCAT GGCAAACAGCT TTAATACACACT 
T TGAGATGGAGGAGTTATAAAAAGAAAT GT CACAGAAGAAAACCACAAAC T TGT T T TAT TGGAC T 
T GT GAATT T T TGAGTACATACTAT GT GT T T CAGAAATATGTAGAAATAAAAATGT TGCCATAAAA 
TAACACCTAAGCATATAC TATTC TATGCT T TAAAATGAGGATGGAAAAGT T TCATGT CATAAGT C 
ACCACC TGGACAATAAT TGATGCCCT TAAAATGCTGAAGACAGAT GTCATACCCAC TGT GTAGCC 
T GT GTATGACTT T TAC TGAACACAGT TATGT TT TGAGGCAGCATGGT T TGATTAGCAT T T CCGCA 
T CCATGCAAACGAGTCACATATGGT GGGACT GGAGCCATAGTAAAGGT TGAT T TAC TTCTACCAA 
CTAGTATATAAAGTAC TAAT TAAATGCTAACATAGGAAGTTAGAAAATAC TAATAACTT T TAT TA 



y 



Rl CT CAGCGAT CTAT T CT TCTGAT GCTAAATAAATTATATATCAGAAAACT T TCAATAT TGGTGAC T 

Q ACCTAAATGT GAT T TT TGC TGGT TACTAAAATAT TCT TACCACTTAAAAGAGCAAGCTAACACAT 

f|J TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
TT CAGT TCT GATAATGT TAAGAATAACCAT TATGAAAAGGAAAAT T T GT CCTGTATAGCATCATT 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TAT T T AAAT AC T T AAC C AC TAAT T T T GAAAAT TAC C AG TGT G AT AC AT AGGAAT CAT TAT T C 
AGAATGTAGTCTGGTCTTTAGGAAGTATTAATAAGAAAATT TGCACATAACTTAGT TGATTCAGA 
AAGGAC T TGTATGC TGT T T T TCTCC CAAAT GAAGACTCT TT T TGACACTAAACAC T T T T TAAAAA 
GC T TAT C T T T GCC T TC TCCAAACAAGAAGCAATAGT C T CCAAGT CAATATAAAT TC TACAGAAAA 
TAGT GT TCT T TT TC TCCAGAAAAATGC T TGT GAGAATCATTAAAACAT GT GACAAT T TAGAGAT T 
CTTTGTTT TATT TCAC TGAT TAATATACTGT GGCAAAT TACACAGATTAT TAAATT T T TT TACAA 
GAGT ATAGTATAT T TAT T TGAAAT GGGAAAAGTGCAT T T TACTGTAT T T T GTGTAT T T TGT TTAT 
T T CTCAGAAT AT GGAAAGAAAAT TAAAATGT GTCAATAAATAT T T TCTAGAGAGTAA 
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MAREDSVKCLRCLLYMjNLLFWI^ 

VMIAVCCFLI I VGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVW^ 

EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 



at 



Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTAT T AAGAGGAT T TT CCAGT GT T T C TG GCAGT T GG T CC AGAAGGATGCCT C CAT T CCT GCT T C TCAC CT G 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACT GACC ACCAGTT GGAT GAGT CT C AAGGTCC T CCT CT AT GT GAC AACC AT GT GAAT GGGGAG 
T GGT ACC ACTT CACGGGC AT GGCGGGAGATGCCAT GCCT ACCTTCTGCATACCAGAAAACCACT GT GGAACCCA 
CGCACCT GTCT GGCTCAAT GGCAGCCACCCCCTAGAAGGCGAC GGCATTGT GCAAC GCCAGGCT TGTGCCAGCT 
T CAAT GGGAACT GCT GTC T CT GG AACAC C AC GGT GGAAGTC AAGGC T T GCC C T GGAGGCT ACTAT GT GT AT C GT 
CT GAC C AAGC CC AGC GTC T GCT T CC AC GTCT ACT GT GGT C ATT T TTAT GAC AT C T GC GAC GAGGAC T GCCAT GG 
CAGCT GC TC AGATAC C AGC GAGT GCAC AT GC GCT C C AGG AACT GT GCT AGGCC C T GACAGGC AGACAT GCTT T G 
f j AT GAAAATGAAT GT GAGC AAAAC AAC GGT GG CT GCAGT GAGAT C T GT GT GAAC C TC AAAAACT CCT ACC GCT GT 

s*i GAGT GT G GGGT T GGCC GT GT GCT AAGAAGT G AT GGC AAG ACT T GT GAAGAC GT T GAAGGAT GC C ACAAT AAC AA 

*F 1 

fp T GGT GGCTGCAGC CAC T C TT GCC TT GGAT CT GAGAAAGGCT AC C AGT GTGAAT GT C CCC GGGGCCT GGT GCT GT 

"i;r ? 

k f\ CT GAGGAT AAC CACAC T T GCC AAGTC C CT GT GT T GT GC AAAT C AAAT GC CAT T GAAGT GAACAT CCCC AGGGAG 

H CT GGTTGGT GGCCT GGAGCTCTT CCTGACCAACACCTCCTGCC GAGGAGT GTCCAACGGCACCCAT GTCAACAT 

CCTCTTCTCTCTCAAGACAT GT GGTACAGTGGTCGATGT GGT GAAT GACAAGATT GTGGCCAGCAACCTCGT GA 
C AGGTCT ACCCAAGC AGACCC C GGGGAGC AGC GGGGACT T CAT CAT C C GAAC C AGC AAGCT GCT GATCCC GGT G 
|J ACCT GC GAGT T T CCAC GCC T GTAC AC C AT TT CTGAAGGAT AC GTTCC C AACC T T CG AAACT CCC C ACT GGAAAT 

III CAT GAGC C GAAAT CAT GGGAT CT T CC C ATTC ACT C T GGAGATC T TC AAGGAC AAT GAGT T T GAAGAGCCT T ACC 

%*J GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGT GGT GCACGT GAGC GGC 

III TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
C C GGGAT GGCT GT GT T TC AGAT GACT C GGT AAAGC AGT AC ACAT CCC GGGAT CAC C T AGC AAAGC ACTT CC AGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGTCT ACAGGGCCAGAC GCT AAC AGG C GGC CC GATCCGCATC GACT GGGAGGAC TAG T TC GT AGC CAT ACCT C 
GAGTCC CT GC AT T GGAC GGCT CT GCT C T T TGGAGC TT CT CCCC C CAC C GCC CT C T AAGAAC AT C T GCC AAC AGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGC ACT GCTGAACAATGT GGCCTGGGTGGGGTTTC ATCT TTCTAGGGTTGAAAACT AAACT GTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT CAGAAGC T GGGT AT AAT AT T TC AAGTT ACAAAC C CT AGAAAAAT T AAAC AGT TACT GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
T TAT AGG GAAT T T GGAAGT GTAT CAAT AAAACAGT ATAT AATTTT 



O 
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MPPFLLLTCLFITGTSVSPVMjDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCI PENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
f»i. LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

W Important features of the protein: 

■rfl 

* r • Signal peptide: 

amino acids 1-16 

lis 

m 

. f\ N-glycosylation sites . 

™ amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

£*% 

111 Tyrosine kinase phosphorylation sites. 

O amino acids 411-418, 443-451 

rii 

l?l N-myristoylation sites. 

m 

*** amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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13 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CC TGCT GCCT TGGGGT GACAAT CT CAGCTCCAGGCT AC AGGGAGACCGGGAGGATC AC AGAGCC AGC ATG T 
T ACAGGATCCTGAC AGT GAT CAACCT CT GAAC AGCCT C GATGTC AAAC CCCT GCGCAAACCCCGTATCCC C 
ATGGAGAC CT TC AGAAAGGT GGGGAT CCCCAT CATC AT AGCACT ACT GAG CCTGGCGAGT ATCATC ATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGCT GTGT GACGGAGAGCTGGACTGTCC CT TGGGGGAGGACG AGGAGC ACT GTGTC AAGAGCTT C 
CC CGAAGGGC CTGCAGTGGC AGTCCGCCT C TCCAAGGACC GATC CACACT GCAGGTGCTGGACT CGGCCAC 
AGGGAACTGGTTCTCT GCCT GT TTCGAC AACTTC AC AGAAGC TC TCGCT G AG AC AGCC TGTAGGCAGATGG 
GCTACAGC AGAGCT GTGGAGAT TGGC CCAGACCAGGATCTGG AT GT TGTT GAAATC AC AGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 



*?* T GGGAAGAGC CTGAAGACCCCCCGT GTGGTGGGT GGGGAGGAGGCC TCTGTGGATTCT TGGCCTTGGCAGG 

in TC AGCATCCAGT ACGACAAACAGCAC GT CTGT GGAGGGAGC ATCCT GGAC CCCC ACTGGGTCCTCACGGC A 

« 

*J| GCCC ACTGCTTC AGGAAACAT ACCGATGTGTTCAAC TG GAAGGT GCGGGC AGGC TCAG AC AAACTGGGCAG 

B CTTCCCAT CCCTGGCTGTGGCCAAGATCAT CATC AT TGAATT CAACCCCATGT ACCCC AAAGACAAT G ACA 

Q T CGCCCT CAT GAAGCTGC AGTT CCCACT CACT TT CTCAGGCACAGT CAGGCCCATCTGTCTGC CCT TCTT T 

a s« 

GATG AGGAGC TCACTCCAGCCACCCCACTCTGGATC AT TGGATGGGGCTT TACGAAGCAGAAT G GAGGGAA 

ft 

GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 

if £ 

:-f ACCAGGGGGAAGTC ACCGAGAAGATGAT GTGT GCAGGC AT CCCGGAAGGGGGTGTGGACACCT G CC AGGGT 

GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CT GCGGGGGC CCGAGCACCCCAGGAGTAT ACACC AAGGTC T C AG CCT AT C TCAACT GG ATCT AC AATGTC T 
GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG ATCCCCC AAAGT CAGACACAGAGC AAGAGT CCCCTT GGGT ACACCC CTCT GCCC ACAGCC TCAGC AT 
T T CT TGGAGC AGC AAAGGGCCTCAAT TCCTGT AAGAGACCCTCGCAGCCC AG AGGCGC CCAGAGGAAGTC A 
GC AGCCCTAGCTCGGC CACACT TGGT GCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAAC AAGGTC T 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
CAGAT C ACTGTGGGCT GGAGAGGAGAAGGAAAGGGTCT GCGCC AGCCCT GTC CGTCTT CACCCATCCC CAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGT TGT C ATT GT T ATT AC AGCT AT GGCCACT AT TAT TAAAGAGC TGTGT AAC ATCT CT GG CAAAAAAAA 
AAAA 
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MLQDPDSDQPLNS LDVKPLRKPRI PMET FRKVG I P 1 1 1 ALLS LAS 1 1 1 WVXI KVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMS DI LLQASVQVIDS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDSGGPLMYQS 
DQWHWG I VS WGYGCGGPS TPGVYTKVSAYLNWI YNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
y a GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
O CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
M AGGCATCAGCCTTGCTGTAGTCT^ATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
; GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
III ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
||| CAGAAGGGGTCTGCTTAGACCACCTGGT TTATGT GACAGGACTT GCATT CT CCTGGAACATGAGG 

ill GAACGCCGGAGGAAA.GCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 

* GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 

H 

CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
P| CCACGTACCAGC TGCTGAAGGGCAACT GCAGGCCGAT GC TC TCAT C AGCC AGGCAGCAGCCAAAA 

HI TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
13 CT CCTT CCCT CTGAGAGGCCC TCCTAT GTCCC TACTAAAGCCACCAGCAAGACATAGC T GACAGG 

^ GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CT CAAT T TAAATCATGT TCTAGTAAT T GGAGC TGTCCCCAAGACCAAAGGAGC TAGAGCT TGGTT 
GAAAT GAT C TCCAAGGGCCC TTATACCCCAGGAGACT T T GAT TTGAAT T T GAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGC TGAGACAGGAGAAT TAC T TGAACC TGGGAGGT GAAGGAGGCTGAGACA 
GGAGAAT CACTT CAGCC TGAGCAACACAGCGAGAC TC T GTC TCAGAAAAAATAAAAAAAGAAT T A 
TGGTTATTTGTAA 
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FIGURE lid 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 
amino acids 1-15 
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FIGURE us 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAAT C C T C C AGAGAAT T G T GAAGAC T G T C ACAT T C TAAAT GCAGAAG CT T T TAAAT C CAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTT T GGGGGAGCAAGCAC TTC T GGCCGGAGGTACCCAAAAAAGCCTATGACAT GGAGCACACTT 
T C T ACAGCAATGGAGAGAAGAAGAAGAT TTACAT GGAAAT T GAT CCTGT GACCAGAAC TGAAATA 
T TCAGAAGCGGAAAT GGCAC TGAT GAAACAT TGGAAGTGCACGACTTTAAAAACGGATACAC T GG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACT TTCT TTGAACAGTCAGTG 
AT TT GGGT CCCAGCAGAAAAGCC TAT T GAAAACCGAGATT T TCT TAAAAAT TCCAAAAT T CTGGA 
GAT T TGTGATAACGTGACCATGTAT T GGAT CAATCCCACT C TAATAT CAGT T TCTGAGT TACAAG 
ACTT TGAGGAGGAGGGAGAAGAT CT T CACTT TCCTGCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAACTTCCAATAAATGACTATACTGAAAAT GGAATAGAAT TTGATCCCATGCTGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT C AC C AAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGIL^TLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSE PEEE I DENEE I TT T FFEQS VI WVPAEKP I ENRDFLKNS KI LE I CDNVTMYW INP T L I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERG YCC I YCRRGNRYCRRVCEPLLGYYP YPYC YQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 

GAGCT CCCCT CAGGAGCGCGTT AGCTTCAC ACCT TC GGCAGCAGGAGG GCGGCAGCT T CT CGCAGGCGGC A 
GGGC GG GCGGCC AGGATC ATGTCCACCACC ACAT GCCAAGTG GTGGCGTTCCTCCT GT CC AT CCTGGGGC T 
GGCCGGCT GC AT CGCGGCCACCGGGATGGACAT GTG GAGC AC CCAGGACCTGT ACGAC AACCCC GTCACC T 
CCGT GT T CCAGT ACGAAGGGCT CT GG AGGAGCTGCGTGAGGC AGAGTT CAGGCT TC ACCGAATGCAGGCC C 
T ATT TC AC CATCCTGGGACT TCCAGC CAT GCTGC AGGC AGTGCGAGCCCTGATG AT CGT AGGCATCGTCC T 
GGGTGCCATT GGCCT C CT GGT ATCCATC TTTGCCCT GAAATGC ATCCGCAT T GGCAGCAT GGAGGACTCT G 
CC AAAGCC AAC ATGAC ACT GACCTCCGGGATC AT GT T CAT TGTCTCAGGTCTTT GTGC AATTGCTGGAGT G 
T C TGTGTTTGCC AACATGCT GGTG ACT AACTTCT GGATGT CCAC AGCT AACATGT ACACC GGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGT TT CTT ATC ATGCCTC AGGCCACAGT GT TGCC TACAAGCCTGGAGGCT TC AAGGCC AGCACT GGCT T 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
CT TCCAAGCACGACT AT GTG TAATGC TC TAAGAC CT CT CAGC ACGGGCGGAAGAAACT CCCGGAGAGCTC A 
CC CAAAAAAC AAGGAGATCCCATCT AGATTTCTTCTT GCT TTTGACTCAC AGCT GGAAGT T AGAAAAGCCT 
CG AT TTCATCTTTGGAGAGGCCAAAT GGT CTT AGCCTC AGTCTCT GTCTCTAAAT AT TCCACCATAAAAC A 
GCTGAGTT ATTT ATGAAT T AGAGGCT AT AGCTC ACATT T T CAAT CCTCTATT TCTTTT TT TAAAT AT AACT 
TTCT ACTCTGATGAGAGAATGT GGTTTT AAT CT C TCTCTC ACAT TT TGAT GATTTAGAC AGACT CCCCCTC 
TTCCT CCTAGTC AATAAACCCATTGATGATCT AT TTCCCAGCTT AT CCCC AAGAAAACTT T T GAAAGGAAA 
GAGT AGACCC AAAGAT GTT ATTT TCT GCTGTT TGAATTTT GT CT CCCC AC CCCC AACT TGGCT AGT AATAA 
ACACTT ACTGAAGAAGAAGCAAT AAGAGAAAGAT ATTT GT AATCTCTCC AGC CC ATGATCTCGGT T TT CTT 
ACACTGTGAT CTT AAAAGTT ACCAAACCAAAGTC AT TT TC AGT T TGAGGC AACC AAACCTTT CT AC T GCT G 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAG AAATTGTCCC TAGATGAATGAGAAAAT T ATTT TT TT TAAT TT AAGT CCTAAAT AT AGT TAA 
AATAAATAAT GTTTTAGT AAAATGAT AC ACTATC TCTGTGAAAT AG CCT C ACCCCT AC ATGT GGAT AGAAG 
GAAAT GAAAAAATAATTGCT TTGACATT GTCTAT ATGGT ACTTTGT AAAGTCAT GCT T AAGT AC AAAT TC C 
AT GAAAAGCTCAC ACCTGT AAT CCT AGC ACT T TGGGAGGCTGAGGAGGAAGGAT C ACT T GAGCCC AGAAGT 
TCGAGACTAGCCT GGGCAAC AT GGAGAAGCCC TG TCTCT ACAAAAT AC AGAGAGAAAAAATC AGCC AGTC A 
TGGTGGCATACACCTGTAGT CCCAGCAT TCCGGGAGGCTG AGGT GGGAGGATCACTT GAGCC C AGGGAGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLVSI^ 

CAI AGVSVFANMLVTNFWMS TANMYTGMGGMV QTVQTRYTFGAALFVGWVAGGLTL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
J-f ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
|l| CAGAT ACT CGAT ACCTT CCCATCGCACAACCCAAAAAAGT TATCACACCGGAAAGAAGTCACCGAGCG 

Iff T CTACTCCAGAAGTCAGTAT GTGTAGTTGTGTATGTTTT TT TAACTTTACTATAAAGCCATGCAAAT G 

IP ACAAAAAT CTATATTACTTT CTCAAAAT GGACCCCAAAGAAACT TTGATTTACTGTT CTTAACTGCCT 

— ? AATCTTAATTACAGGAACTGT GCATCAGCTATTTAT GAT TCTATAAGCTATTT CAGCAGAATGAGATA 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

tsl CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

If 1 

p| ACGACATAGCATTATGTACATAGATGAGTGTAACAT TTATATCTCACATAGAGACAT GCT TATAT GGT 

fll TTTAT TTAAAATGAAATGCCAGT CCAT TACACTGAATAAATAGAACT CAACTATTGCTTTTCAGGGAA 

0 ATCAT GGATAGGGTT GAAGAAGGTTACTATTAATTGTTTAAAAACAGCT TAGGGATTAAT GTCCTCCA 

1 ll TTTAT AAT GAAGATTAAAAT GAAGGCT TTAATCAGCATTGTAAAGGAAATT GAATGGCT T TCTGATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
C T T GT GT AT TAAAT TAACAT T T T T AAAACGCAGAT AT T T T GTCAAGGGGCT T T GCAT TCAAAC T GCT T 
TTCCAGGGCTAT ACTCAGAAGAAAGATAAAAGT GT GATCTAAGAAAAAGTGAT GGTT TTAGGAAAGT G 
AAAATAT T TTT GTTTTT GTAT TT GAAGAAGAATGATGCATT TTGACAAGAAAT CATATATGTATGGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTT CAT T T GC T T AC CAAAAAAACAACAACAAAAAAAGT T GTC CT TTGAGAAC TT CACC T 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATATTAATAAATTGTACAT TTTT CTAAT T 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQ 

I YDSLLALS PDLQAARGLMCAASVMS FLAFMMAI LGMKCTRCTGDNEKVKAHILLTAGI I FI I TG 
MWL I PVSWVANAI I RDFYNS I VNVAQKRELGEALYLGWT TALVL I VGGAL FCCVFCCNEKS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYY 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-168 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC. 
TGGTGGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAAT GT CT GAGGGAAAGC TT TGAGGAGT CCTGGACACCCAAC TAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAAT GCATGCT GTCAGCGTT GGTAT T T CACAT T CAAT GGAGCT GAATGT T CAGGACCTC T TCCC 
AT TGAAGCTATAAT TTAT T TGGACCAAGGAAGCCC TGAAATGAAT TCAACAAT TAAT AT TCAT CG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
ATTATTGAAGAACTACCAAA ATAAA TGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
T T GGAATGGT TCAC T TAAATGACAT T T TAAATAAGTT TATGTATACATCT GAATGAAAAGCAAAG 
CTAAATATGT T TACAGACCAAAGT GTGAT T T CACACTGT TT T TAAATCTAGCATTAT TCATT T T G 
C TTCAAT CAAAAGTGGT T T CAAT AT T T T T T T TAGT TGGT TAGAATACT T T CT TCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCT T T GTACAATT T GTAAATGT TAAGAAT TT T TT TTATATC TGT 
TAAATAAAAATTATTTCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQiCAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 122 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GC C T G G AAG AAT AC AT CAT GT T T T T CGAT AAGAAGAAAT T G TAG GAT C CAG TTTTTTTTT T A 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTT TGAACAATACAAAGGATGGGTT TCAATG 
TAAT TAGGCTAC T GAGCGGATCAGCTGTAGCAC TGGT TATAGCCCCCACT GT CT TACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 



O ATAACAGCCT TCAAAAACT TAAGTATAATCAATT TAAAGGGCTCAACCAGCTCACCTGGCTATAC 

Q CT T GACCATAACCATATCAGCAATAT TGACGAAAATGC T T T TAAT GGAATACGCAGACTCAAAGA 

Ifl GCT GAT TCT TAGT TCCAATAGAAT CT CCTATTTTCT TAACAATACCT TCAGACCT GTGACAAAT T 

£1 TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
f;t CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
W 1 AGACT GCCGCAACC TGGAACTT T T GGACCT GGGATATAACCGGAT CCGAAGT T TAGCCAGGAAT G 

Cp TCT TT GCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATT TTCCAAGCTCAAC 

In CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGAC CATGTCC TGGACCTGGAGCTCCT TACAAAGGCT T GAT T TAT CAGGC AAT GAGA 
JL TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
W TCCAACAAGCTCAC AT T TATTGGTCAAGAGAT T T TGGAT TC T T GGATAT CCCTCAATGACAT CAG 

III TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
f: T TAAAGGT CTAAGGGAGAATACAAT TAT CT GTGCCAGTCCCAAAGAGCT GCAAGGAGTAAAT GTG 

|s s AT CGAT GCAGTGAAGAACTACAGCAT CT GT GGCAAAAGTACTACAGAGAGGT T TGATCTGGCCAG 

GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
*i? TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
RJ TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAATTTTATGTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGATGC TGCTGAAT GGGACGGGACCC TGCACC TA 
TAACAAATCGGGC TCCAGGGAGTGTGAGGT ATGAACCAT TGT GATAAAAAGAGCTC T TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCAT TTTCCTCTCATACATAATCAACCCATTGAAATT TAAATACCACAATCAATGTGAAGCT T 
GAACTCCGGT TTAATATAATACC TATT GTAT AAGACCCT TTACTGAT T CCAT TAATGTCGCAT T T 
GT T T TAAGATAAAACT TC T T TCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLS YNQLHSLGS EQ FRGLRKLLS LHLRSNS LRT I PVRI FQDCRNLELLDLG YNRI RS 
LARWFAGMIRLKELHLEHNQFSKLNI^ 

SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTI ICASPKELQGWVIDAVKNYS ICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12s 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGGGCGGCTACCGTGGCCGAGCT 
AGCAACCT T TCCCCTGGATCTCACAAAAAC T CGACT CCAAATGCAAGGAGAAGCAGCTC T TGC TC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAG GAAGGC T T T C TAAAG C T T T GG CAAGGAG T GAC AC C C GC CAT T T ACAGACAC G TAG T 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
C3 T TAGCCAAT CCAAC TGACC TAGTGAAGGT T CAGATGCAAATGGAAGGAAAAAGGAAAC TGGAAGG 

y AAAACCAT T GCGAT T TCGTGGT GTACATCAT GCAT TTGCAAAAAT CT TAGCT GAAGGAGGAATAC 

fj GAGGGCT TT GGGCAGGC TGGGTACCCAATATACAAAGAGCAGCACTGGT GAATATGGGAGAT T TA 

Iff AC CACT TAT GATACAGTGAAACACTACT TGGTAT T GAATACACCAC TT GAGGACAATATCAT GAC 

fl TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
tfl TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 
* ACTGACTGCT TGATTCAGGC TGT T CAAGGT GAAGGAT TCATGAGTC TATATAAAGGC T T T T TACC 

f ; - : :; 

sJJ ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 

PI 

p T GAGT GGAGT CAGTCCAT T T TAA 

rj| 

III 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGRAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANP TDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGGI RGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



o 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATCCT C TCCATGTT CT CAGC TAT CCGT TCT CAGCACAGCGGT GTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
r l CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
||I GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
ff! AGACACAAT T CGCC T T CAAACTCCACGACAAGGACCC TGAGAAGGCAGCT GCCACC TAT GAGCAA 

*W ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CCAGAT T GGAGACATCCAGAT GAGGCCCACGGAGCAGGTGACC TAG T GAC TGT GGGAGC TCC 
iff TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
Q CACGGGATACCAC T TCCTGTCCACACCCCGACCAGGGGC TAGAAAAT T T GTT TGAGATT T TTATA 

111 TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 

|J TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

I'l ! 

* * r TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCC TGGCCCAGTGGATTT CATGGT GAT CAT T AAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKWGCARTVGNIEEIAMCKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGIARPDTLLSGST^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTAC ATGG GCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAAC T T T GGGT GAATCACCGAT GCCAT T TAAGAGGGT T TTC TGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGT T GAGT GCT TGAAATGAGGAAC T GAGAAAAT TAAT T TC TCAT GTAT T TT TCTCAT T TAT T TA 
TTAATT TT TAACTGATAGTTGTACATAT TTGGGGGTACATGTGATATTTGGATACATGTATACAA 
TATATAATGATCAAATCAGGGTAACTGGGATATCCATCACATCAAACATT TATTTT TTAT TCT TT 
T TAGACAGAGTC TCAC TCTGT CACCCAGGCT GGAGTGCAGTGGTGCCAT C T CAGC T TACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGAGTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T T CCAT CCAT GT TGC T GCAAATGACAGGAT T TCGT TC T TAATTT CAAT TAAAATAACCACACAT G 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 



01 
h 



m 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACCATGATCAGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT ACC AG AGGAT GC TACAACT CTCTACCTT C AGAACAACC AAAT AAAT AATG CTGGGAT TCCTT C AGAT 
TT GAAAAACT TGCT GAAAGT AG AAAGAAT ATACCT ATACC ACAACAGTTT AGAT GAATTTCCT ACC AACC T 
CCCAAAGT AT GT AAAAGAGT T ACAT T TGCAAG AAAATAACAT AAGGACT ATG AC TTAT GATT CACT TTCAA 
AAAT TCCCTATCTGGAAGAATT AC AT TTAGAT GACAACTCTGTCTCTGC AGT T AGC AT AGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GC CC AGGACT AT AGAAGAACT ACGCT TGGATGAT AAT CGCAT ATCC ACT ATT T CAT C ACC AT CT CT TC AAG 
GT CT CACT AGTCTAAAACGC CTGGTT CT AG ATGGAAACCTGTTGAACAAT CATGGT T TAGGTGACAAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTT T AT CTTCAAGAT AAC CACATC AATC GGGT GCCCCCAAATGCTTTTT 
CT TAT C TAAGGC AGCT CT AT CG ACT GGAT ATGTCCAAT AATAACCT AAGT AATT T ACC TC AGGGT ATCTTT 
GATGAT TT GGAC AATATAAC AC AACT GAT TCTTCGC AACAATCCCT GGTATTGCGGGT GC AAGATGAAAT G 
GGTACGTGAC TGGT TAC AAT CACT ACCT GTGAAGGTC AACGTGCGTGGGCTCATGTGCCAAGCCCC AGAAA 
AGGT TCGTGGGAT GGCT AT T AAGGATCT C AAT GC AGAACTGTTT GATTGT AAGGAC AGTGGGAT TGT AAGC 
ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 
CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
CAAT TAC AAT TACT GT GAAGTCTGTCACCTCT GATACCATTC AT ATCTCT TGGAAACT TGCT CT ACCT AT G 
ACTGCTTT GAGACT CAGCTGGCTTAAACTGGGCC ATAGCCCGGC AT TT GGAT CT AT AACAGAAACAAT TGT 
AACAGGGGAACGC AGTGAGT ACTTGGTC AC AGCC CT GGAGCCTG AT TCAC CCT AT AAAGT AT GC AT GGT T C 
CCAT GGAAAC CAGC AACCTC T ACCTATT TGAT GAAACT CC TGT T TGT AT T GAGACTGAAACT GC ACCC CT T 
CGAAT GT ACAAC CCT ACAAC CACCCTCAATCGAG AGCAAGAGAAAGAACC TT AC AAAAAC CCCAAT TT AC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AATGGATC GCTCTT CTCAAGGAACTGT GCATAT AGC AAAGGG AGGAGAAGAAAGGAT GACTAT 
GCAGAAGCT GGCACT AAG AAGGAC AACTCT AT CCTGGAAATC AGGGAAAC TTCTTT T C AGATGT TACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
AC AAAAAC AATCACAGTGAAAGCAGT AGT AACCGAAGCTACAGAGACAGT GGTATT CC AGACTC AGAT C AC 
T C ACACTC ATGATGCT GAAGGACTCACAGCAGACTTGT GT TTT GGGTTT T TT AAACCT AAGGGAGGTGAT G 
GT 
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FIGURE 132 



MI SAAWS I FL IGTKI GLFLQVAPLSVMAKSCPS VCRCDAGFI YCNDRFLTS I PTGI PEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNS VS AVS I EEGAFRDSNYLRLLFLSRNHLS T I PWGLPRT I EELRLDDNRI STISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLYRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFS YLRQL YRLDMSNNNLSNLPQG I FDDLDNI TQL I LRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVST IQI TTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGS PSRKT I T I TVKS VTS DT I HI SWKLALPMT ALRLS WLKLGHS PAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
f : i PYKNPNLPLAAI I GGAVALVT I ALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 

Ifl ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 



Ill 



Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
jjTj N-glycosylation sites. 

f»i amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 

f|| Tyrosine kinase phosphorylation site. 

amino acids 515-522 

N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE taa 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
3 , GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
H TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C| CTCCACTACCTCAAGC TGAGT GAT CCCAAGTACC TAAGAGAGT TCCAGCT GACCCT CCAGCCCGG 

GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
Jp CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
jj| GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
i,ri AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
a GACCACTCCAACAGAGCCAGGAC TATATCAACCTC TT C TGCGCCAACAT GAT GGAC T TGAACCGC 

Jj| AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
Ji{ TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
?t! AACAGCAGGAAGGATGC TTCGGGGAGCCTGATGC T GAAGATGAAGAAT TATC TAAAGCTAT TCAA 

|3 TAT CAGCAGCAT T T T TCGAGGAGAGTGAAGAGGCGAGAAAAACAAT T TCCAGAT TCTCGC TCTGT 

til TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
T T T TATATT T TT T TAGTAGAGACAGGGT T TCATGATGT TGC TCAT GCT GGTC TCGAACT CC TGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC T T TCAAAGAGAC T GT GT TGAATAAAGGGCCAAGGT TCT TGCCACCCAGCAC T C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GT GGCT T CCT ATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TT TAGT CCTCATCCCT TAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATC TGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 13d 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCAISiMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLP PGFKQFS CL ILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



III Important features of the protein: 

if 5 Signal peptide: 

JZl amino acids 1-26 



Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 



m amino acids 149-156, 274-282 

fll N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 



Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTC ATGG CGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATT TGATCCCGTTCGAGTGGATATCACT TCGAAAGGAAAAATGAGAGCAAGATAT G 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCT T CT TACT T TAT TAAAAGGGAATCGT GGGGCT GGACAGAC TTTCTAATGAACCCAAT GGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CT GACAT GAGACGGGAAATGGAGCAGT CAAT GAATATGC TGAATT CCAACCAT GAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACAC TGGGTGGCAT CCAAGT CT TGGAAAACCGT GT GAAGCAACTAC TATAAACT T 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAATAT TGATGT CAC T GAAT TAATTACAGT GT CCTATAGAAAAT GCCAT TAATAAAT TAT 
AT GAACTACTATACAT TATGTATAT T AAT T AAAACATC T TAAT CCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDI PSGS YWEWS PAYRFDPVRVDI TSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMMMVX^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



Important features of the protein: 
Signal sequence: 

i*\ amino acids 1-23 

; ;;;{;? 

ill Transmembrane domain: 

amino acids 161-182 

if s 

m 

i; f| N-glycosylation site. 



amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 



ill 

II i 

pj cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
TGCAAGCCAAAGATGGAAACAT TGACAT CAGAAT C T TAAGGAGGAC T GAGTC T TTGCAAGACACA 
||1 AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
PI AAACTACCAGACCCCT GACCAT TATACT CT CCGGAAGATCAGCAGCC T CGCCAAT TCCT TTCT TA 

CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGAT TCTGAGT CACT TT GAAAAGC TGGAACCT CAGGCAGCAGTTGTGAAGGC 
T T T GGGGGAACTAGACATTCT TCT GCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
CTGCTAAGAATAT T CGAGGT CAAGAGCT CCAGTC T TCAATACCTGCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CT T CCT TGCATGAT TGT CT T TATGCAT CCCCAATC T TAATTGAGACCATACT TGTATAAGATT T T 
TGTAATAT CT TTCTGCTATTGGATATAT TTATTAGTTAATATATTTATT TATT TT TTGCTATT TA 
ATGTAT T T AT TT T T T TAC T TGGACAT GAAACT TTAAAAAAAT T CACAGAT TATAT T TATAACC T G 
AC TAGAGCAGGT GATGTAT T TT TATACAGTAAAAAAAAAAAACC T T GTAAAT TCTAGAAGAGTGG 
CTAGGGGGGT TAT TCAT T TGTAT T CAAC TAAGGACATATTTACT CATGCTGAT GC TC TGTGAGAT 
AT T TGAAAT T GAACCAATGACTACT TAGGAT GGG T TGTGGAATAAGTT T T GATGTGGAAT TGCAC 
ATC TACCTTACAAT TAC TGACCAT CCCCAGTAGAC TCCCCAGTCCCATAAT T GTGTATC T TCCAG 
CCAGGAATCCTACACGGCCAGCATGTAT TTC TACAAATAAAGTT T T CT T TGCATACCAAAAAAAA 
AAAAAAAAAAA 



o 

IF I 
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FIGURE 138 

MRQFPKTS FDI S PEMS FS I YSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPE I F 
SSREAWQFFLLLWSPDFRPKMKASSIAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SIANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features of the protein: 
:f ^ : Signal peptide: 

f'i amino acids 1-42 

m 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

3.3'js 

!j| amino acids 192-195, 225-228 

9 N-myristoylation sites. 

Q amino acids 42-47, 46-51, 136-141 

||| 

1st 

III 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAAT T TAT GCCCTACACAAC T TACC TTGT GGAAAAAGGAGCATCT CACAGT 
TT TGTGGCTGAAGCAAAGCCACCTACAGTTAC TATGCC TCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTCT TT TATGGAGATGAAACCTGGGTTAAA^ TA 
TTCCCAAAGCATT T TG TGGAATAT GATGGAACAACC TCATT T T TCGT GT CAGAT TACACAGAGG T 
f" 4 - GGATAATAAT GTCACGAGGCAT TT GGATAAAGTATTAAAAAGAGGAGAT TGGGACATAT TAATCC 

Q TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCT^ACAGCCCCCTGATTGGGCAG 
p| AAGCTGAGCGAGATGGACAGCGTGC TGATGAAGATCCACACC TCACT GCAGTCGAAGGAGAGAGA 

m GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 

I I GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
PJ CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
If! ACT TGGCTTACCGAT TCCAAAAGACAGT GT AGGGAGCC TCC TAT T CCCAGT T GTGGAAGGAAGAC 
fu CAATGAGAGAGCAGTTGAGATT TT TACATT TGAATACAGTGCAGC T T AGTAAACTGT TGCAAGAG 

AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
' kbi GAACTGGAT CAGAC TGTACT TGGAGGAAAAGCAT TCAGAAGT CCTAT T CAACC TGGGCTCCAAGG 

s TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
Q TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 

III CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
f'vl CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
Jj* GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 

AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
|2 TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
ffl CAAGGT GAT T GTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGC T GAGGGGGTGTG TGAA 

TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GAC C GAGACAC T CACAGC T T T GT CAT C AG G GCAC AG GCT T C C T C GGAG C CAGGAT GATC TG T G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKA1MTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMS E TGSHGAS S TEEVNT PL I L I SSAFERKPGDI RHPKHVQ 

Important features of the protein: 
^ Signal peptide: 

5 _ S 

amino acids 1-34 

III Transmembrane domain: 

If? 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE lai 



GGCACGAGGCAAGCCT TCCAGGTTATCGTGACGCACCT TGAAAGTCTGAGAGCTAC TGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
T CCCTAT AGAAAACAAC TGCCAGCACCT TAAGACCACTCACACCT T CAGAGT GAAGAAC T TAAAC 
CCGAAGAAAT TCAGCAT T CATGACCAGGAT CACAAAGTAC T GGTCC TGGACT CTGGGAATCTCAT 
AGCAGTT CCAGATAAAAAC TACATACGCCCAGAGATCT TCT T TGCATTAGCCT CAT CCT TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGT GACAAGGATAAAGGACAAAGTCAT CCAT CCC T TCAGC TGAAGAAGGAGAAACT GAT GAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
Ifl GTT GGGGTGACAGATAAAT T T GAGAACAGGAAACACAT TGAAT T T T CAT T TCAACCAGT T TGCAA 

III AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
Wj AT TTGAACT AATT GTATAAAAACACCAAACCTGC T CAC T 
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FIGURE 142 

MLLLLLE YN FP I ENNCQHLKT THT FRVKNLNPKKFS I HDQDHKVLVLDS GNL IAVP DKNY IRPE I 
FFALASSLSSASAEKGS P ILLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

InterleuJcin-1 

amino acids 37-182 
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FIGURE 143 



CTAGAGAGTATAGGGCAGAAGGATGGCAGAT GAGT GAC TCCACATCCAGAGCTGCC TCCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTT TGC TGTCCACACAT T CAACCAACAGAGCAAGGACTACTAT GCC TACAGAC TGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
p. GGGGAGAACTAGGT GT GGGAAAT T T GAAGACGACAT TGACAACT GCCAT T TCCAAGAAAGCACAG 

||! AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
s ACACAT C TTGAGCCTAATCATGTAGTGTAGAT CAT TAAACATCAGCAT T TTAAGAAAAAAAAAAA 

C3 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

If! 
13 



|.si 



w 

w 

ill 
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FIGURE iaa 

MLGLPWKGGLSWALLLLLLGSQ I LLI YAWHFHEQRDCDEHNVMARYLPATVE FAVHT FNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 1AZ 

CTGTGCAGC TCGAGGC TCCAGAGGCACACTCCAGAGAGAGCCAAGGT TCT GACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGT T CCAGAAGCCAGACAACAAGC T CCACCAGCAGGTGC TC TGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCAGGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCT TCACAGTAC T TCCCAACAAC TCT TAGAGGTAG 
GTGTAT T CCCGT T T TACAGATAAGGAAACT GAGGCCCAGAGAGCT GAAGTAC T GCACCCAGCAT C 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAAT TCCAGAGCTGTC T GGT GATCAC T T TATGTCTCACAGGGACCCACAT CCAAACATGTAT 
CTCTAATGAZ^ATTGTGAAAGCTCCATGTTTAGAAATAAATGAAAACACCTGA 
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FIGURE 146 

MRKHLSWWWLATVCMLL FSHLSAVQTRG IKHRIKWNRKALPSTAQ I TEAQVAENRPGAFI KQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 



Important features of the protein: 
Signal peptide: 

amino acids 1-26. 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

PJ Tyrosine kinase phosphorylation site. 

Ill amino acids 76-83 

H 

*J? N-myristoylation sites. 

m amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 



m 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 



SO 



Ill 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 

GTCTCCGC GT C ACAG GAACT TCAGCAC C CACAGGG C GGACAGC GC T CC CCT CT AC C T GGAGACT T GAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGTGCAGAAAGACCTT TGAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGCTCCCCAC TTGATAATAAGTT GAAT GTGGAAGAT GTTGATT CAACCAAGAATCGAAAACTG 
ATCGATGATTATGACTCTACTAAGAGTGGATT GGATCATAAATT T CAAGATGATCCAGATGGTCT TCA 
TGAACTAGACGGGACTCCT TTAACC GCTGAAGACATTGT CCATAAAAT CGCTGCCAGGATTTATGAAG 
AAA&T GACAGAGCCGTGTTTGACAAGAT TGTT T CTAAACTACTTAATCTCGGCCT TATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTT TTACAAAAATTAATC TCAAAGGAAGCCAACAA 
TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
TGACT CCAATGGCAGCAATTCAAGATGGT CT TGCTAAGGGAGAAAACGATGAAACAGTAT CTAACACA 
TTAACCT T GACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACT TT GAGGAACT CCA 
ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACTGATTACTATCATGAAAACACTGAT TGACT TTGTGAAGATGATGGTGAAATATGGAACAATATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATATGGAAGCT TGAAGGATT CCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATC^GAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRS S PLDNKLNVEDVDS TKNRKLI DDYDS TKS GLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKE TL I T IMKTLI DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCT TC TAGC TGGAGGGCT GCAT GCAGGGAAGGTCAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATGACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
? GAGTGGAGGAGACCCATGGCGGACAAT CACT CTCT C TGC T CTCAGGACCCCCACGTC TGACT TAG 

jj£ TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CI CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
Q CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
III TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
J*j GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
^ . GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 

II! TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
fit GAAAT GGCT CGAGCTCAGAAGATAAAAGATAAGTAGGGTATGC TGAT CCT C TT T TAAAAACCCAA 

•|| GATACAATCAAAATCCCAGAT GCT GGTC T CTATTCCCATGAAAAAGT GC T CAT GACATAT TGAGA 

AGACCTACT TACAAAGTGGCATATATT GCAAT TTATTT TAAT TAAAAGATACC TAT T TATATATT 
* w TCTTTATAGAAAAAAGTC TGGAAGAGT T TACT TCAATT GTAGCAAT GTCAGGGTGGTGGCAGTAT 

O AGGTGAT TTTTCTTT TAAT TC TGT TAAT T TAT CTG TAT T TCCTAAT T T T T CT ACAATGAAGATGA 

|R AT TCCT TGTATAAAAATAAGAAAAGAAAT TAATCTTGAGGTAAGCAGAGCAGACATCATCTCTGA 

f| TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ft! ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTC TC TGCAGATGTAGT TAGT TAAGACAAGGT CAT GC TGGATGAAGGTAGACC TAAATTCAATAT 
GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGAT GAAGGCAGAGATCGGAGT T TTGCAGC CACAAGC TAAGAAACACCAAGGATTGTGGCAACC 
ATCAGAAGCT TGGAAGAGGCAAAGAAGAATTC TTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
C T GAAACCT TAATC TCAGAC TT CCAGCC TCCTGAACGAAGAAAGAATAAATT TCGGC TGT T TTAA 
GCCACCAAGGATAATT GGTTACAGCAGC TCTAGGAAAC TAATACAGCT GC TAAAATGATCCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGT GATGGCAT GCCACT TCCAAGATTAGGT TATAAAAGACAC TGCAGC T TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGAC T C T GT GAGATAAT CGAT GTT TGT TGT T T T 
AAGT T GCTCAGT T T TGGTCTAAC T TGT TAT GCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQ 

QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins* 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE lss 



If! 

'41 



CTTCAGAACAGGT TCTCCTTCCCCAGTCACCAGTTGC TCGAGT TAGAATTGTC TGCAATGGCCGC 
CCTGCAGAAATGTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGT TC GTC TCAT TGGGGAGAAAC TGT TCCACGGAGTCAGTAT GAGTGAGCGC TGCT 
AT CT GAT GAAGCAGGTGCTGAACT T CACCCT TGAAGAAGTGC T GTT CCC T CAATCTGATAGGTT C 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGT GATGACC TGCATAT CCAGAGGAAT GT GCAAAAGCT GAAGGACACAGTGAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAAT T GGAGAAC TGGAT TT GCT GT T TAT GTCT C TGAGAAAT 
GCCTGCATT TGACCAGAGCAAAGCTGAAAAATGAATAAC TAAC CCCCTTTCCCTGC TAGAAATAA 
CAAT TAGAT GCCCCAAAGCGAT TT TTT T TAACCAAAAGGAAGATGGGAAGCCAAAC TCCATCAT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACTT TCTAAGGATAGATATT TATTGATAACATT TCATTGTAACTGGTGTTC 
TATACACAGAAAACAAT T TAT T TT T TAAATAAT TGTCT T T T TCCATAAAAAAGATTACT TTCCAT 
TC C T T T AGG GGAAAAAAC CC C T AAAT AG CT T CAT G T T T C C ATAAT CAGT AC T T TAT AT T TAT AAA 
TGTAT T TATTAT TAT TATAAGACTGCATT TTAT T TAT AT CAT T T TAT TAATAT GGAT TTAT T TAT 
AGAAACATCATT CGATAT TGC TACT TGAGT GTAAGGC TAATAT TGATAT T TAT GACAATAAT TAT 
AGAGC TATAACATGTT TATT TGACCTCAATAAACAC T TGGATATCCC 
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FIGURE is* 

MAALQKSVS S FLMGTLATSCLLLLALLVQGGAAAP I S SHCRLDKSNFQQP YI TNRT FMLAKEASL 
ADJSTNTDTOLIGEKLFHGVSMSERCYmKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 



APPJD=1 0063569 



Page 306 



FIGURE irs 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACC TGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
p% GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
Q CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
IP TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
JjpJ AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
fr| GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
a AC CACT T G C CAT GAAGG GCC AGGAT GCC CAGAT GC TTGGCCCCTGT GAAG T G C T GT C T GGAGC AG 

13 CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
^ 1 CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
|p GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
p% GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
jffj TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TT GACAT GGAT GAT TC TGAGGAGGAAGCTGTTAT TGAATGTATAGAGAT T TATCCAAATAAATAT 
CT T TAT T T AAAAAT GAAAAA 
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FIGURE 156 



MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT T GAGGGACCTCCGAGTAGAACCTGT TACAACTAGTGT TGCAACAGGGGACTAT T CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATT T GT GT GACGGGCAAAAGCAAC T T CCAGTCC TACAGCTGTGT GAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGC TGAACACAGTCTAT TTCAT T GGGGCCCATAATAT T CC TAAT GCAAATAT GAAT GAAGATGG 
CCCT T CCATGTC TGTGAATT TCAC C T CACCAGGCT GCC TAGACCACATAAT GAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGT GAAC T TCACAACCACT CCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
TATCATCGGGTT TTCTCAGGTGTT TGAGCCACACCAGAAGAAACAAACGCGAGCT TCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAATT TGTTAC T TCAC TGAAT T T CT TCAAAACCATTGCAGAAGT GAGGTCATCCT TGAAA 
AG TGGCAGAAAAAGAAAATAGCAGAGAT GGGT CCAGTGCAGTGGC T TGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATCTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE is8 

MSLVLLSLAMjCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMIWSW 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSWFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIECVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNL FC S DL RS Q I HLHKY VW Y FRE I DT KDDYNAL SVC P KY HLMKDAT AFC AE LL H VKQQVS AGKRS QAC HD 
GCCSL 

if" 5 ;. 

ipj Important features of the protein: 

|f1 Signal peptide: 

%U amino acids 1-14 

!p ■ 

if f 5 

¥ * Transmembrane domain : 

™* amino acids 290-309 



sj«. N-glycosylation sites. 

P| amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 

fll - 287 

Ill cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 



Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 



N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AG CCACCAGC GCAft.CATGA.CAG T GAAGAC CC T GCAT GGCCCAG C CAT GG T CAAGTAC T TGC T GC T 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACT T GGGCTGCATCAATGCTCAAGGAAAGGAAGACATC T CCAT GAAT TCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
AC TCAGCTGAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 



•••:r 



Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-rayristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCAC T CCGTGC T GGAAG TAGGAGGAGAGT CAGGAC TCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGATGAGGAAAAGT T T GGAGGAGCAGC TGAC T CAGGGGTGGAGGAGCCTAGGAA 
|^ TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
J« % TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
1*1 GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
|f! ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
y GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
^ CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CP CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
h!| GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
s TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
PI AACAGCTCGGAGAAGCrGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
IN GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
O CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
p| CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
pf CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
jj* CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
III CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAATAAAGGCAGACGCT GT T T T T C TAfiAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVIiAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVWDCFEAALGSEVRIWSYTQPRYEKEIJ^HTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLE FPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALWIACLLFA 
SU- RAALLL Y S ADDSG FE RLVGALASALCQL PLRVAVDL WS RRELS AQG P VAW FH AQRRQTLQE G GVWLL FS P 

O GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
W LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

m 

» 5 5 

TZ Signal sequence: 

^! amino acids 1-20 

Transmembrane domain. 

r : t amino acids 453-475 

|3 N-glycosylation sites. 



ru 



amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 16ft 



13 



i y 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCC T GACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGT CTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GT CAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGT T CAGCT CT G TGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TCTACCACT TAGAGCTCCAGGT CAAC CGCACCTACCAAATGCACCTT GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 



*** TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
13 CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
fp TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
§«§ AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
?JJ TCCACAGCGGCATAGCC TGTCCGAGAT CAC CTAC T TAGGGCAGCCAGACATCTCCATCC TCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
IP GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
J" CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGGCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
111 CCCT GGGGAT T TGCACAGACAGAACAT C TGAC CCAAAT GTGC TACACAGT GGGGAGGAAGGGACA 

Q CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
J»( GAGCAGCCCACAGAAC TGGAT TC TCT T T TCAGAGGCC T GGCCCT GACT GT GCAGT GGGAGTCC TG 

III AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGT GGCATGCAGGGCCCCTGCCAT GGGT GCGC TCC T CACC GGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCT GGAAAAGAACCAGGCC T GGAAAAGAACCAGAAGGAGGC TGGGCAGAACCAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T T CCCAGCCAGGGCAAC TGCC TGACGT TGCACGATT T CAGCT TCAT TCCTCT GATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCT T T T C T GCAGGCAGGAGT TTCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
AT CTGTAATGTGGGGAT CATAACACCTACC T CATGGAGT TGTGGT GAAGAT GAAAT GAAGTCAT G 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAiyV 
AAAAAAAA 
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FIGURE i6a 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
E FLGT IMI CVP TWAKE SAPYMCRVKTL PDRTWT YS FSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TnGrr.TAGTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGT GACGATGTATAT GT C T ACTACAAAC T GGCAGGGGGAT CCC T TGAACT T TGGGCTGGA 
AGTGT T GAACACAGT T T TGGATAT TT T CCAAAAGATT TGATCAAGGTACTT CATAAATACACGGA 
AGAAGAGC TACATAT TCCAGCAGATGAGACAGACT TTGTC TGCT T T GAAGGAGGAAGAGATGATT 
TTAAT AGT TATAATGTAGAAGAGCTT T TAGGATC T TT GGAACT GGAGGACTC TGTACC TGAAGAG 
TCGAAGAAAGCTGAAGAAGT T TCT CAGCACAGAGAGAAAT CT CCTGAGGAGT C TCGGGGGCGTGA 
AC TT GACCC T GTGCC TGAGCCCGAGGCATT CAGAGCTGAT TCAGAGGAT GGAGAAGGTGC T T TCT 

CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 

» p 

m CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 

%n 

■J% ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 

9 AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 

13 CAT T ACAGCAAAGGAT T TCGT TGGCAT CAAAATCTAAGT TTGT T T TACAAAGAT T GT TT T TAGT A 

in 



CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPKA.FRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 



Important features of the protein: 
f«% Signal peptide: 

£| amino acids 1-22 



N-glycosylation site, 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 



fll 

f r ! amino acids 67-76 

fll 



Tyrosine kinase phosphorylation site. 



N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT T GAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACC T GACAAAACCCGGG 
{wt ACAGGGAT T T CT GATGGTGATT T CTGGATAGGGCT TTGGAGGAATGGAGAT GGGCAAACATCT GG 

O TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
pi ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
J;;;f CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTATGAACCAGAGAT TAAT CCAACAGCCCCTGTAGAAAAGCCT TAT CT TACAAATCAACCAG 
||j GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
|f! ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
$m GCAT AAAAGTAAAGGAAGAACAAAAACT AGT CCAAACCAGTC TACACT GTGGAT TTCAAAGAGTA 

% CCAGAAAAGAAAGTGGCATGGAAGTATAATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
W GGAT CT GT AT AAGGAATGGC AT CAGAACAAT AGC T TGGAATGGCT T GAAAT CACAAAGGATC T GC 

C AAG AT G AAC T GT AAGC TCCCCCTT GAGG C AAAT AT TAAAG TAAT T T T TAT AT G T C TAT TAT T T C A 
f| TT TAAAGAATAT GCTGTGC TAAT AAT GGAGT GAGACATGCT TAT T T TGCT AAAGGATGCACCCAA 

m ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
jj| TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
Q TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
fll AT GAACT GT T CTAATAT TTATT TT TATGGCAT CTCAT T T TT CAATACATGCT C T T T TGAT TAAAG 

p AAACT TATTACTGTTGTCAACTGAAT TCACACACACACAAATATAGTACCATAGAAAAAGTT TGT 

f|| TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
- AATAAGAAGCTAT TTCATTAAGTGTGATATAAACCTCCTCAAACAT TT TACTTAGAGGCAAGGAT 

TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
T T GTAATAAT GTAACT T TGT TAAT AGGT GCAT AAACAC TAATGCAG TCAAT T TGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
AT TGACACT GGAGGCAGATAGT T GCAAAGT T AGTC TAAGGT T TCCCT AGCT GTAT T TAGCCTC T G 
AC TAT AT TAGTATACAAAGAGGT CAT GT GGT T GAGACCAGGTGAATAG TCAC TATCAGTGTGGAG 
ACAAGCACAGCACACAGACATT TTAGGAAGGAAAGGAAC TAC GAAAT CGTGT GAAAATGGGT TGG 
AACCCATCAGTGATCGCATATT CATT GATGAGGGT TTGC T TGAGATAGAAAAT GGT GGC TCCT T T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



in 
cri 



fit 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELS SRVS FQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTT^PGLGGPYLYQWNDDRCNMKHOTICKYEPEINPTAPVEK 
PYLTNQPGDTHQNWVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 



iai N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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